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I.  INTRODUCTION 

Near  infrared  (NIR)  diffuse  optical  imaging  provides  quantitative  functional  information  from  breast  tissue  that 
can  not  be  obtained  by  conventional  radiological  methods.  NIR  techniques  can  provide  in  vivo  measurements 
of  oxygenation  and  vascularization  state,  the  uptake  and  release  of  molecular  contrast  agents  and  chromophore 
concentrations  with  high  sensitivity.  There  is  considerable  evidence  that  tumor  growth  is  dependent  on  angio¬ 
genesis  [1]-  [3],  and  that  tumor  aggressiveness  can  be  assessed  from  its  increased  number  of  new  vessels  and 
reduced  oxygenation  state  relative  to  normal  breast  tissue  and  benign  breast  lesions  [4]-  [6].  NIR  diffuse  optical 
tomographic  (DOT)  methods  has  the  potential  to  characterize  angiogenesis  related  vessel  density  as  it  measures 
the  total  hemoglobin  concentration  and  provide  the  ability  to  differentiate  between  benign  and  malignant  lesions 
based  on  oxygen  saturation.  Furthermore,  NIR  methods  are  non-ionizing,  relatively  inexpensive  and  can  be  made 
portable. 

The  diagnosis  and  management  of  cancer  involves  several  stages  where  magnetic  resonance  (MR)  plays  a 
valuable  and  growing  role.  MRI  of  the  breast  is  now  a  routine  part  of  the  clinical  care  in  many  centers  [9]- 
[11].  Magnetic  Resonance  imaging  (MRI)  is  indicated  in  patients  with  inconclusive  clinical  and/or  mammographic 
examinations.  Patients  that  may  bene  t  include  women  with  radiographically  dense  breasts,  and  high  risk  potential 
population  [12]-  [13].  MRI  possesses  less  than  10%  contrast  for  soft  tissue  pathology  [14],  Gadolinium  (Gd) 
enhanced  MRI  offers  much  better  contrast  and  is  speci  c  for  tumor  vessel  imaging.  However,  the  signal  in  the 
Gd-MRI  arises  from  the  larger  vessels  as  the  contrast  agent  is  ushed  out  of  the  vascular  bed  of  the  tumor  [15]. 
In  comparison,  NIR  measurements  of  absorption  have  extremely  high  contrast.  It  was  reported  that  5%  change 
in  vascular  density  as  measured  histologically  in  ductal  carcinomas  leads  to  approximately  300%  contrast  in  NIR 
absoiption  coef  cients  [7].  Furthermore,  there  are  studies  suggesting  that  the  kinetics  of  contrast  enhanced  optical 
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spectroscopy  provides  information  about  the  cellular  spaces  [8].  On  the  other  hand,  NIR  DOT  suffers  from  poor 
spatial  resolution  and  as  such,  it  is  unlikely  that  NIR  imaging  will  be  a  stand-alone  screening  method  in  the  general 
population.  Therefore,  we  believe  that  the  concurrent  MR  and  NIR  imaging  brings  together  the  most  advantageous 
aspects  of  the  two  imaging  modalities  (structural  and  functional).  In  the  future,  we  envision  that  this  multimodality 
imaging  approach  will  lead  to  high  resolution  hemoglobin  tomography  and  comprehensive  quantitative  functional 
tissue  characterization  to  differentiate  malignant  and  benign  tumors. 

In  this  project,  the  clinical  studies  are  performed  using  the  novel  MR-NIR  hybrid  time-resolved  spectroscopy 
(TRS)  imager  and  fast  Indocynine  Green  (ICG)  enhanced  spectroscopic  imager  developed  by  Dr.  Chance,  a  Co-PI 
of  this  proposal,  at  the  University  of  Pennsylvania  (UPenn),  Biophysics  Department,  Diffuse  Optical  Imaging  and 
Spectroscopy  Laboratory. 

The  central  hypothesis  of  this  project  is  that  the  concurrent  MR-NIR  diffuse  optical  tomographic  methods  coupled 
with  fast  contrast  enhanced  NIR  spectroscopic  methods  provide  fundamentally  new  quantitative  functional  and 
structural  information  for  breast  cancer  tumor  characterization  and  detection.  This  new  information  can  be  obtained 
by  novel  modeling,  analysis  and  data  fusion  methods  from  the  tomographic,  temporal  and  cellular-based  contrast 
measurements,  which  exploit  fast  imaging  techniques  together  with  TRS  tomographic  methods.  In  this  project, 
we  investigate  new  methods  for  multi-modality  high  spatial  resolution  hemoglobin  tomography,  pharmacokinetic 
modeling  of  molecular  contrast  agents  based  on  fast  NIR  spectroscopy  and  analysis  of  structural  and  functional 
information  provided  by  MR  and  NIR  imaging  methods  for  breast  cancer  detection  based  on  receiver  operating 
characteristics  methodology.  Speci  c  aims  of  the  project  are  as  follows: 

•  Aim  1 :  Utilize  a  priori  anatomical  information  provided  by  MRI,  to  reconstruct  3D  high  resolution  hemoglobin, 
water  and  lipid  concentration,  and  oxygen  saturation  images  directly  from  6  wavelength  time  resolved  optical 
measurements.  Evaluate  improvements  in  image  reconstruction  between  that  of  stand-alone  NIR  and  concurrent 
MR-NIR  measurements  using  water  and  lipid  images  obtained  from  MRI. 

•  Aim  2:  Develop  a  compartmentalized  pharmacokinetic  modeling  of  ICG,  optical  contrast  agent,  and  extract 
quantitative  parameters  that  can  characterize  tumor  metabolism  and  angiogenesis.  Compare  ICG  kinetics  with 
the  Gadolinium,  MR  contrast  agent,  kinetics  and  biopsy  ndings. 

•  Aim  3:  Evaluate  accuracy  of  breast  cancer  diagnosis  based  on  the  quantitative  functional  information  extracted 
from  stand-alone  NIR  system.  This  information  includes  hemoglobin,  water  and  lipid  concentration,  optical 
scatter  power  and  oxygen  saturation  images,  and  ICG  pharmacokinetic  parameters.  Evaluate  the  added  value 
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of  ICG  kinetic  parameters  in  breast  cancer  diagnosis. 

•  Aim  4:  Combine  NIR  based  breast  cancer  diagnosis  features  with  the  systematic  MR  breast  architecture  and 
kinetics  interpretation  model  developed  by  Dr.  Nunes,  M.D,  Co-PI  of  this  proposal,  to  evaluate  the  sensitivity 
and  speci  city  of  concurrent  MR-NIR  imaging  method.  Compare  results  with  that  of  stand-alone  MR  and  NIR 
results. 

In  the  following  sections,  we  will  provide  detailed  description  of  our  current  research  in  line  with  the  statement 
of  work  (SOW)  and  the  aims  outlined  above.  For  the  period  of  June  1st,  2006  to  May  31st  2007,  SOW  includes 
only  the  rst  two  aims  of  the  project. 


II.  BODY 

In  the  year  3  of  the  project,  SOW  includes  tasks  related  to  Aim  2  and  3  of  the  project.  Below,  we  describe  the 
works  that  has  been  performed  towards  the  Aim  2,  Task  3  and  Aim  3  Tasks  3  and  4.  We  also  improved  upon  the 
Task  1  of  Aim  1. 

A.  AIM  2  -  Tasks  3  /  AIM  3-  Task  I 

1 )  Reconstruction  of  Bulk  ICG  Concentration  Images:  In  our  data  collection  process,  a  sequence  of  boundary 
measurements  are  collected  over  a  period  of  time.  Each  set  of  measurements  are  used  to  form  a  frame  of  the  ICG 
concentration  images.  The  resulting  sequence  of  ICG  concentration  images  are  then  used  to  form  pharmacokinetic- 
rate  images.  To  reconstruct  each  frame  of  the  ICG  concentration  images,  we  follow  a  static  reconstruction  approach 
and  use  differential  diffuse  optical  tomography  (DDOT)  technique  [18],  [23]. 

In  DDOT,  two  sets  of  excitation  measurements  are  collected  corresponding  to  before  and  after  the  ICG  injection, 
and  the  ICG  concentration  is  determined  by  the  perturbation  method  [18],  [23].  The  photon  propagation  before  and 
after  the  injection  is  modeled  by  the  following  diffusion  equation: 

V-[DxV<Z>±(r,u)-[fj,±x(r)  +  ju;/c}<Z>±(r,u)  =  0,  r  G  D  C  M3  (1) 

with  Robin- type  boundary  conditions: 

2Dx^  +  P®t  =  ~S(r,  w),  r  G  d(l.  (2) 

where  x  stands  for  the  excitation,  c  is  the  speed  of  light  inside  the  medium  Q;  w  denotes  the  modulation  frequency 
of  the  source,  fifx(r)  and  /x+.(r)  are  the  absorption  coefcients  before  and  after  the  ICG  injection,  Dx  is  the 
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diffusion  coefcient  which  is  assumed  to  be  constant,  <f>^  (r,  uj)  optical  eld  at  location  r  before  and  after  the  ICG 
injection.  Here,  v  denotes  the  outward  normal  to  the  boundary  Oil  of  Q,  p  is  a  constant  representing  the  refractive 
index  mismatch  between  the  two  regions  separated  by  dQ,  and  S(r,LO )  is  the  excitation  source  on  the  boundary. 

The  absorption  coef  cient  after  the  injection  //+.  are  modeled  as  a  sum  of  the  absorption  coef  cient  of  the 
medium  before  the  ICG  injection  /j,~x  and  the  perturbation  caused  by  the  ICG  A/iax(r ): 

A Hax(r)  =  ntx(r)  -  n~x(r),  r  e  H  C  M3.  (3) 

In  the  forward  model,  the  analytical  solutions  of  the  heterogonous  diffusion  equation  given  in  (1)  is  derived  using 
rst  order  Rytov  approximation  [18].  The  sample  volume  is  divided  into  a  set  of  voxels  and  the  measurements 
are  related  to  the  relative  absorption  coef  cients  of  each  voxel  by  a  system  of  linear  equations.  The  shape  of  the 
breast  was  approximated  as  a  cylinder  and  the  Kirchhoff  approximation  [24],  [25]  for  diffuse  waves  was  used  to 
model  the  interaction  of  light  with  boundaries.  In  order  to  account  for  the  biological  noise,  the  forward  model  was 
implemented  with  coupling  coef  cient  technique  [26]. 

Here,  the  Rytov-type  measurements,  which  are  de  ned  by  the  natural  logarithm  of  the  ratio  of  the  post-ICG 
measurements  to  the  pre-ICG  measurements  were  used  [23].  Let  ^x(rd,oj;  rs)  denote  the  Rytov-type  measurements 
at  location  rd  due  to  source  at  rs.  The  linearized  relationship  between  the  differential  absorption  coef  cient  and 
measurements  is  given  by  [21], 

Vx(rd,w,ra)  = - — - -  G~{r  —  rd,  uj;  r s)Ix(r)&~  (r,  uj;  r s)d3r  (4) 

(rd,uj;rs)  Jn 

where  3>x(r,  uj;  rs)  is  the  photon  density  obtained  at  the  excitation  wavelength  before  ICG  injection,  Ix(r )  = 
cA/iax(r) / Dx,  and  G~(r  —  rd,  uj;  rs)  is  the  Green’s  function  of  (1)  for  a  source  at  rs  before  the  injection  describing 
the  propagation  of  light  from  the  heterogeneity  r  to  the  detector  at  rd. 

We  address  the  inverse  problem  of  recovering  A/jax  from  Rytov-measurements  based  on  the  forward  model 
(4)  using  the  singular  value  decomposition  of  the  Moore-Penrose  generalized  system.  We  use  a  zeroth-order 
Tikhonov  regularization  to  stabilize  the  inversion  procedure.  We  apply  the  L-curve  method  to  an  experimental 
model  reconstruction  and  derive  the  best  regularization  parameter  using  a  curvature  function  as  described  in  [27], 
A  detailed  discussion  of  the  forward  and  inverse  models  used  for  the  reconstruction  of  differential  absoiption 
coef  cients  (A/j,ax)  can  be  found  in  [18]. 

To  construct  a  set  of  ICG  concentration  images,  we  use  the  linear  relationship  between  the  differential  absorption 


coefcients  and  ICG  concentrations  [20]: 


A pa(r)  =  In  10  e\  m(r)  =  2.303  e\  m(r ) 


(5) 


where  is  the  extinction  coefcient  of  ICG  at  the  wavelength  805nm,  m(r)  is  the  bulk  ICG  concentration  in  the 
tissue,  and  A pa(f)  is  as  de  ned  in  (3). 

2)  Modeling  and  Estimation  of  ICG  Pharmacokinetics:  Two-compartment  Model  of  ICG  Pharmacokinetics'. 
Using  the  method  outlined  in  Section  2,  we  reconstruct  a  sequence  of  ICG  concentration  images.  As  an  example, 
Figure  3  shows  a  set  of  images  reconstructed  from  in  vivo  breast  data. 

Our  objective  is  to  model  the  pharmacokinetics  of  ICG  at  each  voxel  of  ICG  concentration  images  using 
compartmental  modeling.  To  do  so,  we  rst  extracted  the  time  varying  ICG  concentration  curves  for  each  voxel 
from  the  sequence  of  ICG  concentration  images.  An  example  of  such  a  curve  is  shown  in  Figure  6.  We  next  t  a 
two-compartment  model  to  each  ICG  concentration  curve  [19],  [22],  Figure  23  shows  the  two-compartment  model 
for  ICG  kinetics.  Here,  plasma  and  extracellular-extravascular  space  (EES)  constitute  the  two  compartments.  Cp 
and  Ce  represent  the  ICG  concentrations  in  plasma  and  the  EES,  respectively.  The  rates  km  and  kout  govern  the 
leakage  into  and  the  drainage  out  of  the  EES.  The  parameter  keim  describes  the  ICG  elimination  from  the  body 
through  kidneys  and  livers. 

Using  the  two-compartment  model  introduced  in  [19],  ICG  transition  between  plasma  and  the  EES  can  be 
modeled  as  follows: 

Ce(t) 

Cp(t ) 


kout  k{n 

kout  ( kin  T  ^eim) 


ce{f) 

Cp(t ) 

+  w(i), 


(6) 


where  Cp(t)  and  Ce(t)  represent  the  ICG  concentrations  in  plasma  and  the  EES  at  time  t  <E  [To,  Ti],  respectively. 
The  rates  kin,  kout,  and  keim  have  a  unit  of  sec-1.  They  are  de  ned  as  the  permeability  surface  area  products 
given  by  PSp,  where  P  is  the  capillary  permeability  constant,  S  is  the  capillary  surface  area,  and  p  is  the  tissue 
density.  Here,  u(t)  is  uncorrelated  zero-mean  Gaussian  process  with  covariance  matrix  Q  representing  the  model 
mismatch. 

The  actual  total  ICG  concentration  in  the  tissue  is  a  linear  combination  of  plasma  and  the  EES  ICG  concentrations 
modeled  as: 


m(t) 


Ve  Vp 


Ce(t ) 
Cp(t) 


+  r)(t ) 


(7) 


9 


where  m(t),  Ce(t),  and  Cp(t)  are  de  ned  in  (5)  and  (6);  vp  and  ve  are  plasma  and  the  EES  volume  fractions, 
respectively;  and  77(f)  is  uncorrelated  zero-mean  Gaussian  process  with  covariance  matrix  R,  representing  the 
measurement  noise. 

Estimation  of  ICG  Pharmacokinetics  using  Extended  Kalman  Filtering:  In  matrix-vector  notation,  (6)  and  (7) 
can  be  expressed  as: 

C(f)  =  K(a)C(f)  +  w(f),  (8) 

m(t)  =  V  (a)C(f)  +  77(f), 


where  C(t)  denotes  the  concentration  vector  with  elements  Ce(t),  and  Cp(t );  K(a)  is  the  system  matrix,  V(a)  is 
the  measurement  matrix  as  de  ned  in  equation  (7),  and  a  is  the  parameter  vector  given  by 

®  —  \fout  kin  kflrn  Ve  Vp\  .  (9) 


The  ICG  measurements  in  (8)  are  collected  at  discrete  time  instances,  f  =  kT,  k  =  0,1,...,  where  T  is  the 
sampling  period.  Therefore,  the  continuous  model  described  in  (8)  is  discretized.  We  can  express  the  discrete 
compartmental  model  as  follows: 

Cd(k  +  l)  =  Kd(d)Cd(k)  +  ud(k),  (10) 

m  (k)  =  \d(0)Cd{k)  +  r]d(k ), 


where  Kd(6)  =  eK  -  is  the  discrete  time  system  matrix;  \d(0)  =  \(a)  is  the  discrete  measurement  matrix;  u>d(k) 
and  r}d(k)  are  zero-mean  Gaussian  white  noise  processes  with  covariances  matrix  Qd  and  variance  R,/.  respectively. 
The  vector  6  is  composed  of  parameters  Ttj  which  are  functions  the  pharmacokinetic-rates  and  volume  fractions: 


e  = 


(ii) 


Til  T12  T21  r22  Ve  Vp 

We  rst  estimate  Ttfs,  i.  j  =  1,2  and  then  compute  the  pharmacokinetic -rates  kin,  kout  and  ke,irn  [19],  [28].  The 
explicit  form  of  the  discrete  state-space  model  is  given  as  follows: 


ce(k  +  1) 

Til  T12 

Ce(k) 

cp(k  +  1) 

T21  T22 

cp{k) 

+  vd(k) 


(12) 


m{k)  = 


Vp.  Vn 


Ce(k) 

Cp(k) 


+  Vd(k). 


We  estimate  the  parameter  vector  6  and  concentration  vector  by  using  the  EKF  framework.  The  EKF  is 
a  recursive  modeling  and  estimation  method  with  numerous  advantages  in  ICG  pharmacokinetic  modeling  [19]. 
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These  include  effective  modeling  of  multiple  compartments,  and  multiple  measurement  systems  in  the  presence 
of  measurement  noise  and  uncertainties  in  the  compartmental  model  dynamics,  simultaneous  estimation  of  model 
parameters  and  ICG  concentrations  in  each  compartment,  statistical  validation  of  estimated  concentrations  and 
error  bounds  on  the  model  parameter  estimates,  and  incorporation  of  available  a  priori  information  about  the  initial 
conditions  of  the  permeability  rates  into  the  estimation  procedure. 

When  both  states  (ICG  concentrations)  and  model  parameters  (pharmacokinetic-rates  and  volume  fractions)  are 
unknown,  a  linear  state-space  model  can  be  regarded  as  a  non-linear  model;  the  linear  system  parameters  and  states 
combine  to  form  the  new  states  of  the  non-linear  model.  This  system  is  then  linearized  and  the  new  unknown  states 
are  found  using  the  EKF  estimator  [19],  [29]— [3 1  ].  In  EKF  framework,  6  can  be  treated  as  a  random  process  with 
the  following  model: 

e(k  +  i)  =  e(k)  +  u(k),  (13) 

where  $d{k)  is  a  zero-mean  Gaussian  process  with  covariance  matrix  S^. 

Table  1  summarizes  the  joint  estimation  of  pharmacokinetic-rates  and  ICG  concentration  in  different  compart¬ 
ments.  In  Table  1,  Cd(k\k  —  1)  is  the  state  estimate  propagation  at  step  k  given  all  the  measurements  up  to  step 
k  —  1;  C d{k)  is  the  state  estimate  update  at  step  k:  Pk,k-i  denotes  the  error  covariance  propagation  at  step  k  given 
all  the  measurements  up  to  step  k  —  1;  P k,k  is  the  error  covariance  update  at  step  k;  S,/  is  the  preassigned  covariance 
matrix  of  Sd(k);  ,J /,  is  the  Jacobian  matrix  due  to  iterative  linearization  of  the  state  equation  at  step  k;  G/,.  is  the 
recursive  Kalman  gain  at  step  k;  R,/  is  the  covariance  matrix  of  the  measurements;  Qd  is  the  covariance  matrix  of 
the  concentration  vector;  and  I  is  the  identity  matrix.  A  detailed  discussion  of  the  Kalman  ltering  algorithm  for 
the  joint  estimation  of  the  model  parameters  and  ICG  concentrations  in  different  compartments  can  be  found  in 
[19]. 

3)  Clinical  Results  -  Apparatus:  :  In  this  work,  we  use  the  data  collected  with  a  continuous  wave  (CW)  NIR 
imaging  apparatus.  The  apparatus  has  16  light  sources,  which  are  tungsten  bulbs  with  less  than  1  watt  of  output 
power.  They  are  located  on  a  circular  holder  at  an  equal  distance  from  each  other  with  22.5  degrees  apart.  Sixteen 
detectors,  namely,  silicon  photodiodes,  are  situated  in  the  same  plane.  The  breast  is  arranged  in  a  pendular  geometry 
with  the  source-detector  probes  gently  touching  its  surface.  Figure  2  illustrates  the  con  guration  of  the  apparatus 
and  the  con  guration  of  the  detectors  and  the  sources  in  a  circular  plane.  Note  that  sources  and  detectors  are 
co-located.  The  detectors  use  the  same  positions  as  the  sources  to  collect  the  light  originating  from  one  source  at 
a  time.  Only  the  signals  from  the  farthest  1 1  detectors  are  used  in  the  analysis.  For  example,  when  Source  1  is 
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on,  the  data  is  collected  using  Detectors  4  to  14.  A  band  pass  Iter  at  805  nm,  the  absorption  peak  of  ICG,  is 
placed  in  front  of  the  sources  to  select  the  desired  wavelength.  A  set  of  data  for  one  source  is  collected  every  ~ 
500  ms.  The  total  time  for  a  whole  scan  of  the  breast  including  16  sources  and  16  detectors  is  ~  8.8  seconds.  A 
more  detailed  explanation  of  the  apparatus  and  the  data  collection  procedure  can  be  found  in  [32]. 

Protocol.  Patients  with  suspicious  breast  tumors  are  enrolled  for  this  study.  ICG  is  injected  intravenously  by 
bolus  with  a  concentration  of  0.25  mg  per  kg  of  body  weight.  Diagnostic  information  is  obtained  using  biopsy 
results.  Since  biopsy  modi  es  the  blood  volume  and  blood  o  w  around  the  tumor  region,  measurements  are  made 
prior  to  biopsy.  Data  acquisition  started  before  the  injection  of  ICG  and  continued  for  10  minutes. 

Tumor  Information'.  Three  different  patients  with  different  tumor  types  are  included  in  this  study.  First  case, 
(Case  1),  is  broadenoma,  which  corresponds  to  a  mass  estimated  to  be  1—2  cm  in  diameter  within  a  breast  of  9 
cm  diameter  located  at  6-7  o’clock.  Second  case,  (Case  2),  is  adenocarcinoma  corresponding  to  a  tumor  estimated 
to  be  2—3  cm  in  diameter  within  a  breast  of  7.7  cm  diameter  located  at  4-5  o’clock.  The  third  case,  (Case  3),  is 
invasive  ductal  carcinoma,  which  corresponds  to  a  mass  estimated  to  be  4  by  3  cm  located  at  6  o’clock.  Table  2 
describes  the  tumor  information  for  each  patient. 

4)  ICG  Concentration  Measurements  for  Pharmacokinetic  Parameter  Estimations:  Using  the  CW  imager  de¬ 
scribed  above,  suf  cient  number  of  source-detector  readings  are  collected  from  different  angles  for  each  patient. 
Differential  absorption  images  are  reconstructed  based  on  DDOT  forward  model  given  in  (4).  Using  the  linear 
relationship  (5)  between  ICG  concentration,  and  absorption  coef  cient,  ICG  concentration  images  are  obtained  for 
each  case.  A  sample  set  of  ICG  concentration  images  for  the  selected  time  instants  are  shown  in  Figure  3,  4, 
and  5  for  Case  1,  2,  and  3,  respectively.  Although  only  9  images  are  displayed,  there  are  around  50  images  for 
each  case,  each  corresponding  to  a  different  time  instant.  Each  image  is  composed  of  649  voxels.  Note  that  the 
ICG  concentration  images  in  Figures  3,  4,  and  5  represent  the  bulk  ICG  concentrations  in  the  tissue,  not  the  ICG 
concentrations  in  plasma  or  in  the  EES  compartments. 

We  next  extract  the  time  course  of  ICG  concentration  for  each  voxel.  As  an  example,  Figure  6  shows  the  time 
course  of  ICG  concentrations  for  all  three  cases  for  a  speci  c  voxel  around  the  tumor  region  (65th,  216th,  188^ 
voxel  for  Case  1,  Case  2,  and  Case  3,  respectively).  We  then  t  the  two-compartment  model  to  each  time  course 
data  using  the  EKF  framework;  and  estimate  kin,  kout,  keim,  and  the  ICG  concentrations  in  plasma  and  EES. 
Spatially  resolved  images  of  kin,  and  kout  for  each  case  are  shown  in  Figures  8(a)-(b),  and  9(a)-(b),  10(a)-(b), 
respectively.  Additionally,  we  construct  ICG  concentration  images  for  plasma  and  the  EES  compartments.  Figures 
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11-16  show  the  ICG  concentration  in  plasma  and  the  EES  for  3  different  time  instants  for  Case  1,  2,  and  3, 
respectively.  Our  results  show  that  the  pharmacokinetic-rates  are  higher  around  the  tumor  region  agreeing  with  the 
fact  that  permeability  increases  around  the  tumor  region  due  to  compromised  capillaries  of  cancer  vessels.  We  also 
observed  that  ICG  concentrations  in  plasma  and  the  EES  compartments  are  higher  around  the  tumors  agreeing  with 
the  hypothesis  that  around  the  tumor  region  ICG  may  act  as  a  diffusible  extravascular  o  w  in  leaky  capillary  of 
cancer  vessels. 

Using  the  brightest  spot  as  a  reference  point,  the  a  priori  information  on  the  location,  and  the  size  of  the  tumors, 
we  plot  an  ellipse  (or  a  circle)  to  identify  the  approximate  location  and  size  of  the  tumor  in  the  pharmacokinetic -rate 
images.  Figures  17(a),  (b),  and  (c)  present  the  km  images  with  approximate  tumor  location  and  size  for  Case  1,  2, 
and  3,  respectively.  The  consistency  of  the  bright  regions  in  the  kin  images,  and  circular/elliptical  regions  drawn 
based  on  the  a  priori  information  shows  that  the  pharmacokinetic-rate  images  provide  good  localization  of  tumors 
and  size  information. 

The  histograms  of  kin  and  kout  rates  from  inside  and  outside  the  tumor  region  (as  indicated  by  circular/elliptical 
regions)  are  shown  in  Figures  18(a),  (b),  and  (c),  and  Figures  19(a),  (b),  and  (c),  respectively.  The  solid  curves 
in  Figures  18  and  19  show  the  Gaussian  t.  Clearly,  pharmacokinetic-rate  images  facilitates  analysis  of  the  spatial 
variation  of  the  pharmacokinetic-rates  for  breast  cancer  screening,  diagnosis,  and  staging.  The  histograms  and  their 
Gaussian  probability  density  function  ts  in  Figures  18  and  19  show  that  the  mean  and  the  standard  deviation  of 
kin  and  kout  values  are  different  inside  and  outside  the  tumor  region.  Table  3  tabulates  the  mean  values  (±  spatial 
standard  deviation)  of  the  pharmacokinetic-rates  from  inside  and  outside  the  tumor  regions  for  all  three  cases.  The 
total  error  (probability  of  false  positive  and  false  negative)  using  the  Bayesian  minimum  error  classi  er  is  given  by 
the  overlapping  area  between  the  two  Gaussian  ts  [33].  Table  4  tabulates  the  total  Bayesian  error.  These  small  error 
values  show  that  the  spatially  resolved  pharmacokinetics-rate  analysis  provide  good  sensitivity  and  speci  city  for 
diagnosis.  One  other  advantage  of  the  pharmacokinetic-rate  images  is  that  they  allow  comparison  of  rates  obtained 
from  healthy  and  tumor  tissue  for  the  patient  itself  since  absolute  (mean)  values  may  be  quite  different  from  patient 
to  patient.  For  example,  the  kin  rate  for  Case  3  obtained  outside  the  tumor  region  is  higher  than  the  kin  rate  for 
Case  1  obtained  inside  the  tumor.  Thus,  using  the  pharmacokinetic-rates  obtained  from  healthy  tissue  as  a  reference 
to  the  tumor  tissue  for  the  same  patient  can  improve  diagnostic  accuracy.  Flowever,  very  high  mean  values  of  kin 
and  kout  may  be  indicative  of  the  severity  of  malignancy.  The  pharmacokinetic  rates  are  higher  for  Case  3  (invasive 
ductal  carcinoma),  for  both  inside  and  outside  the  tumor  region  as  compared  to  Case  2  (adenocarcinoma).  Similarly, 
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the  kinetic  rates  are  higher  for  Case  2  (adenocarcinoma),  as  compared  to  Case  1  (  broadenoma)  for  both  inside 
and  outside  the  tumor  region. 

To  further  understand  the  value  of  spatially  resolved  pharmacokinetic-rate  analysis  as  compared  to  the  bulk 
pharmacokinetic-rates,  we  averaged  the  concentration  images  spatially,  and  obtained  a  bulk  concentration  value  for 
each  time  instant.  We  then  extracted  a  time  curve  for  the  bulk  ICG  concentrations.  Figure  7  shows  the  resulting 
bulk  ICG  concentration  time  curves  for  each  case.  Next,  we  tted  the  two-compartment  model  to  the  resulting  time 
curves  and  estimated  the  bulk  pharmacokinetic -rates.  Table  5  tabulates  the  bulk  pharmacokinetic-rates  for  each 
patient.  In  Figure  20  and  2 1 ,  the  bulk  pharmacokinetic-rates  are  overlaid  on  the  histograms  of  the  pharmacokinetic- 
rate  images  obtained  from  inside  and  outside  the  tumor  region.  Figures  20  and  21  show  the  Bayesian  minimum 
error  classi  er  threshold  (the  value  corresponding  to  the  intersection  of  the  histograms)  for  each  case.  We  see  that 
for  Case  1,  the  bulk  rates  of  k{n,  and  kovi  are  both  classi  ed  as  healthy  tissue  (outside  the  tumor  region).  For  Case 
2,  kin  is  classi  ed  as  cancerous  tissue  (inside  the  tumor  region)  and  kout  is  classi  ed  as  healthy  tissue.  Similarly  for 
Case  3,  kin  is  classi  ed  as  healthy  and  kout  is  classi  ed  as  cancerous  tissue.  Clearly,  the  bulk  rates  do  not  provide 
consistent  diagnosis.  On  the  other  hand,  spatially  resolved  pharmacokinetic-rates  allow  us  to  compare  values  from 
the  cancerous  region  to  the  healthy  background,  and  provide  higher  sensitivity,  and  speci  city  as  shown  by  the 
Bayesian  total  error  values  in  Table  IV. 
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Fig.  1.  Block  diagram  of  the  two-compartment  model  for  ICG  pharniacokinetics. 
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TABLE  I 

EKF  ALGORITHM  FOR  SIMULTANEOUS  ESTIMATION  OF  STATES  AND  PARAMETERS. 


Initial  Conditions 


Cd(0) 

0(0) 


E(  Cd(0)) 
0(0) 


Po,o  = 


V  ar(Cd(0)) 
0 


0 

sd 


State  Estimate  Propagation 


Cd(fc|fc-1) 

Kd(0(fc-l))Cd(fc-l) 

0{k\k-  1) 

0(k  -  1) 

Error  Covariance  Propagation 


Pfc,fc  — 1  —  Jk  —  lPfc  —  l,fc  —  1  Jk—1  "E 


Qd 

0 


0 

Sd 


State  Estimate  Update 


Cd(fc) 

Cd(fc|fc-1) 

6(k\k-l) 

+Gfe(m(fc)  -  Xd(9(k\k  -  l))Cd(k\k  -  1)) 


Error  Covariance  Update 
Kalman  Gain 

Definitions 


P  k,k  —  [I  Hfc  Ais^_i]Pk,k-l 

Hfc  =  Pfc^-iAfe|fc_1[Afc|fc_1Pfc,fc_iAfc|fc_1  +Rd]_1 

\d(0(k\k  —  1)) 
0 


Jk  = 


K  4m)  ^[Kd(0(k))Cd(k)} 

0  I 


TABLE  II 

Tumor  information  for  each  patient 


Tumor  Type 

Tumor  Size 

Tumor  Location 

Case  1 

Fibroadenoma 

1-2  cm 

6-7  o'clock 

Case  2 

Adenocarcinoma 

2-3  cm 

4-5  o'clock 

Case  3 

Invasive  Ductal  Carcinoma 

4  by  3  cm 

6  o'clock 

TABLE  III 

Mean  and  standard  deviation  of  pharmacokinetic-rates  for  inside  and  outside  the  tumor  region 


kin  ( sec 

-Her2) 

kout  (s6C 

_1kt2) 

kelm  (s6C 

^KT3) 

Inside 

Outside 

Inside 

Outside 

Inside 

Outside 

Case  1 

2.14±0.018 

0.73±0.011 

1.24±0.069 

0.43±0.013 

4.11±0.057 

3.87±0.012 

Case  2 

2.92±0.076 

1.14±0.052 

1.58±0.051 

0.65±0.036 

3.94±0.081 

4.12±0.047 

Case  3 

6.87±0.093 

3.06±0.015 

4.96±0.048 

1.66±0.072 

4.49±0.056 

4.46±0.081 
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TABLE  IV 

Total  Error  (probability  of  false  positive  and  false  negative)  in  tumor  diagnosis  Based  on  Bayesian  minimum 

ERROR  CLASSIFIER 


Total  Error  for  kin 

Total  Error  for  kout 

Case  1 

6.67  x  10“2 

13.34  x  10"2 

Case  2 

3.23  x  10“2 

13.51  x  10"2 

Case  3 

9.09x  10-2 

3.64  x  10~2 

TABLE  V 

Bulk  Pharmacokinetic-rates  extracted  from  the  entire  breast  tissue 


kin  ( sec  110  2) 

kout  ( sec  110  2) 

fcelm  (S6C  10  ) 

Case  1 

0.84±0.013 

0.62±0.017 

3.66±0.042 

Case  2 

2.01±0.022 

0.83±0.012 

4.01±0.054 

Case  3 

4.06±0.072 

3.36±0.051 

4.37±0.052 

Fig.  2. 


The  cut  section  of  the  CW  NIR  imaging  apparatus  with  16  sources  and  detectors. 
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Fig.  3.  Spatially  resolved  bulk  ICG  concentration  images  for  a  set  of  time  instants  for  Case  1. 
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Fig.  4.  Spatially  resolved  bulk  ICG  concentration  images  for  a  set  of  time  instants  for  Case  2. 
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Fig.  5. 


Spatially  resolved  bulk  ICG  concentration  images  for  a  set  of  time  instants  for  Case  3. 
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Fig.  6.  Time  course  of  ICG  concentration  curves  for  a  speci  c  voxel,  65th,  216th,  188t/l  voxel  for  Case  1,  Case  2,  and  Case  3,  respectively. 


Fig.  7.  Averaged  time  course  of  ICG  concentration  curves  for  Case  1,  Case  2,  and  Case  3. 


(a)  (b) 

Fig.  8.  Spatially  resolved  pharmacokinetic-rate  images,  (a)  kin,  and  (b)  kout  for  Case  1. 
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Fig.  9.  Spatially  resolved  pharmacokinetic-rate  images,  (a)  ki„,  and  (b)  kout  for  Case  2. 


Fig.  16.  Spatially  resolved  ICG  concentration  images  in  the  EES  for  Case  3  for  the  246.4th,  378.4th,  and  510.4th  seconds. 
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Fig.  17.  The  kin  images  with  approximate  tumor  location  and  size  for  Case  1,  Case  2,  and  Case  3,  respectively.  An  ellipse  or  a  circle 
identi  es  the  approximate  location,  shape,  and  size  of  the  tumor. 


(c) 

Fig.  18.  The  histograms  of  kin  for  (a)  Case  1,  (b)  Case  2,  (c),  Case  3  from  inside  (blue)  and  outside  (gray)  the  tumor  region  (as  indicated 
by  circular/elliptical  regions).  The  solid  lines  in  gures  shows  the  Gaussian  t. 
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Fig.  19.  The  histograms  of  kout  for  (a)  Case  1,  (b)  Case  2,  (c),  Case  3  from  inside  (blue)  and  outside  (gray)  the  tumor  region  (as  indicated 
by  circular/elliptical  regions).  The  solid  lines  in  gures  shows  the  Gaussian  t. 
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(a)  (b) 


(c) 

Fig.  20.  Bulk  kin  rates  for  (a)  Case  1,  (b)  Case  2,  (c),  Case  3  together  with  histogram  ts.  The  solid  blue  line  indicates  the  bulk 


pharmacokinetic-rate,  and  the  dashed  red  line  indicates  the  Bayesian  minimum  error. 
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(a)  (b) 


(c) 

Fig.  21.  Bulk  kout  rates  for  (a)  Case  1,  (b)  Case  2,  (c),  Case  3  together  with  histogram  ts.  The  solid  blue  line  indicates  the  bulk 


pharmacokinetic -rate,  and  the  dashed  red  line  indicates  the  Bayesian  minimum  error. 
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B.  AIM  3  -  Tasks  1-4 

The  SOW  with  regard  to  Aim  3  includes  the  following  speci  c  tasks: 

•  Task  1.  Determine  statistical  variability  of  each  NIR  feature  inside  and  outside  the  suspected  tumor  in  an 
individual  and  evaluate  the  statistical  signi  cance  of  the  measured  difference  with  the  instrumentation  precision. 
12-18th  month 

•  Task  2.  Design  statistical  classi  ers  to  determine  the  ROC  of  each  NIR  feature  for  an  individual.  18-24th 
month 

•  Task  3.  Evaluate  the  ROC,  positive  predicted  value  (PPV)  and  negative  predicted  value  (NPV)  of  various 
combinations  of  the  NIR  features  for  an  individual.  24-27th  month 

•  Task  4.  Investigate  the  signi  cance  of  the  measured  difference  between  malignant  and  benign  tumor  patient 
groups  for  single  and  combined  NIR  features.  27-30th  month 

Year  3  tasks  involve  Task  3  and  4. 

This  work  describes  the  characterization  ef  cienc  y  of  optical  properties  of  breast  tumors  based  on  the  features 
obtained  using  in  vivo  near-infrared  (NIR)  spectroscopy  measurements.  Three  features,  relative  blood  concentration, 
oxygen  saturation  and  the  size  of  the  tumor,  are  used  to  diagnose  benign  and  malignant  tumors.  The  performance  of 
the  proposed  set  of  features  are  evaluated  by  various  classi  ers  using  data  acquired  from  44  patients  with  malignant 
tumors,  and  72  patients  with  benign  tumors.  The  area  under  the  receiver  operating  characteristics  (ROC)  curve  of 
the  scaled  nearest  mean  classi  er  (NMSC)  using  the  three  features  yields  a  value  of  0.91  with  a  signi  cance  level 
of  0.05.  Our  results  suggest  that  the  features,  relative  blood  concentration,  and  oxygen  saturation  can  differentiate 
breast  tumors  with  a  relatively  high  precision. 

1)  Methods:  Apparatus :  In  this  study,  a  continuous  wave  (CW)  near  infrared  spectrometer  (NIRS)  is  used  [?]. 
The  apparatus  includes  a  probe  (Fig.  22).  In  the  center  of  the  probe  there  is  a  3-wavelength  light  emitting  diode 
(LED).  The  probe  consisted  of  one  multi-wavelength  LED  as  a  light  source  and  8  silicon  diodes  as  detectors. 
The  detectors  surround  the  LED  with  a  4  cm  radius.  The  light  intensity  from  the  detectors  was  adjusted  to  be 
approximately  1  volt  and  calibrated  with  a  phantom  with  known  absorption  and  scattering  coef  cients. 

Patients  and  Protocol :  This  study  includes  two  centers,  namely,  the  Abramson  Family  Cancer  Research  Institute, 
Department  of  Radiology  of  the  Hospital  of  University  of  Pennsylvania  (HUP),  and  the  Department  of  Gynecology 
of  Leipzig  University  (DGLU).  HUP  provided  24  patients  with  malignant  and  64  patients  benign  tumors.  DGLU 
provided  20  patients  with  malignant  and  6  benign  tumors. 


26 


Fig.  22.  The  NIR  probe  with  a  multi-wavelength  LED  and  8  silicon  diodes  as  detectors. 

TABLE  VI 

AUC  values  FOR  different  classifiers  FOR  F1-F2-F3:  A BV,  ADeoxy,  S 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.9098 

0.9041 

0.9017 

0.8984 

0.8864 

0.8843 

0.8807 

0.8752 

The  measurements  are  taken  on  the  breast  with  tumor.  Then,  the  probe  is  transferred  to  the  tumor  free  contralateral 
breast  to  include  the  mirror  image  location  of  the  suspected  cancer.  The  sensors  giving  the  largest  changes  with 
respect  to  the  mirror  image  position  on  the  contralateral  breast  are  related  to  the  suspected  cancer.  Feature  Extraction : 
In  this  study,  three  features,  namely,  relative  blood  concentration,  ABV,  oxygen  saturation,  ADeoxy,  and  the  size 
of  the  tumor,  S,  are  used. 

The  features,  ABV,  and  ADeoxy  are  obtained  using 

AOD  =  eACL  (14) 

where  OD  is  the  optical  density,  e  is  the  extinction  coef  cient,  C  is  blood  concentration,  L  is  the  mean  pathlength 
of  photons,  and  A  denotes  relative  change.  Here,  e  ~  1  cm”1,  and  L  =  4  cm  for  a  pathlength  factor  of  5. 

Following  (1),  the  relative  blood  concentration,  ABV,  and  the  oxygen  saturation,  ADeoxy,  can  be  approximated 
at  two  different  wavelengths  by 

ABV  oc  O.3AOD730  +  AOD850  (15) 

ADeoxy  oc  I.3AOD730  +  AOD^^q  (16) 

where  AOD730,  and  AO D^  denote  the  relative  changes  in  optical  density  at  730  nm  and  850  nm,  respectively. 
ABV  and  ADeoxy  can  also  be  approximated  by 


ABV  oc  A [Hb\  +  A  [Hb02\ 


(17) 
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TABLE  VII 

AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  F1-F2:  A BV ,  ADeoxy 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.9001 

0.8993 

0.8930 

0.8908 

0.8992 

0.8821 

0.8782 

0.8645 

A Deoxy  oc  A [Hb02]  -  A [Hb]  (18) 

where  A  [Hb],  and  A  [HbO-fi  denote  the  relative  change  in  deoxyhemoglobin  (Hb)  and  oxyhemoglobin  (Hb02). 
The  concentrations  of  Hb,  and  Hb02  in  (17)  and  (18)  are  calculated  by  the  Beer-Lambert  Law  given  by 

A  OD  =  logj  (19) 

where  I  is  light  intensity  after  absorption  and  scattering,  and  Iq  is  the  baseline  light  intensity  obtained  from  the 
contralateral  breast,  using  known  extinction  coefcients  of  Hb,  Hb02  and  differential  pathlength  factors  [16]. 

Here,  it  is  important  to  note  that,  A  BV,  and  ADeoxy  values  are  based  on  a  lipid  blood  oxygen  model.  Thus 
the  increments  of  BV  and  Deoxy  are  relative  to  the  contralateral  breast: 

A  BV  =  A  BVtumor  —  ABVcontra  (20) 

ADeoxy  =  A  DeoxytUmor  ~  A  Deoxy  contra  (21) 

where  A  BVtumor,  ABVcontra  are  relative  blood  volume  in  the  tumor  breast  and  the  mirror  image  position  of  the 
contralateral  breast,  respectively,  and  ADeoxy  tumor,  A  Deoxy  contra  are  relative  oxygen  saturation  in  the  tumor 
breast  and  the  mirror  image  position  of  the  contralateral  breast,  respectively. 

Feature  Analysis  and  Tumor  Classification :  In  this  subsection,  we  present  the  set  of  tumor  classi  cation  features, 
and  the  malignancy  differentiation  criteria.  FI  denotes  A  BV,  F2  denotes  ADeoxy,  and  F3  denotes,  S,  size  of  the 
tumor.  We  evaluate  the  malignancy  differentiation  capability  of  the  individual  features  and  various  combinations 
of  the  these  features  using  a  set  of  classi  ers,  namely,  k-nearest  neighbor  classi  er  (KNNC),  Parzen  density  based 
classi  er  (PAR),  automatic  neural  network  classi  er  (NEURC),  normal  densities  based  linear  classi  er  (LDC), 
nearest  mean  classi  er  (NMC),  scaled  nearest  mean  classi  er  (NMSC),  normal  densities  based  quadratic  classi  er 
(QDC),  uncorrelated  normal  densities  based  quadratic  classi  er  (UDC).  The  more  details  information  on  these 
classi  ers  can  be  found  in  [17], 

We  evaluated  the  malignancy  differentiation  capability  of  the  following  individual  and  combined  features: 


FI:  A  BV 
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F2  :  A  Deoxy 
F3  :  Tumor  Size  (S') 

F1-F2:  A  BV  and  A  Deoxy 
F1-F2-F3:  A  BV,  ADeoxy,  and  S 

2)  Statistical  Analysis  of  Clinical  Data:  The  evaluation  is  based  on  receiver  operating  characteristics  (ROC) 
methodology.  The  ROC  curve  is  obtained  by  plotting  the  probability  of  false  positive  rate  versus  the  probability 
of  detection.  The  evaluation  of  classi  cation  method  is  done  using  area  under  the  ROC  curve  (AUC).  First,  we 


Fig.  23.  Scaled  Nearest  Mean  Classier  and  F1-F2  2-D  data  clustering. 

evaluated  the  classi  cation  performance  of  all  three  features.  Table  VI  presents  the  AUC  values  for  8  different 
classi  ers  for  all  three  features.  The  NMSC  has  the  best  performance  in  terms  of  classi  cation  with  a  AUC  value 
of  0.9098  followed  by  the  Parzen  classi  er  with  a  AUC  value  of  0.9041. 

Next,  we  evaluated  the  performance  of  the  two  features  measured  by  NIR  spectroscopy.  Table  VII  presents  the 
AUC  values  for  8  different  classi  ers  for  features  A  BV  and  A  Deoxy.  Again,  the  NMSC  performed  the  best  in 
terms  of  classi  cation  with  a  AUC  value  of  0.9001.  Finally,  we  evaluated  the  individual  classi  cation  performances 
of  the  three  features.  Table  VIII  presents  the  AUC  values  for  8  different  classi  ers  for  the  feature  A  BV.  The  NMC 
has  the  best  performance  in  terms  of  classi  cation  with  a  AUC  value  of  0.8832.  Table  IX  presents  the  results  8 
different  classi  ers  for  the  feature  A  Deoxy.  The  NMC  has  the  best  performance  in  terms  of  classi  cation  with  a 
AUC  value  of  0.879.  Table  X  presents  the  results  8  different  classi  ers  for  the  feature  S.  The  QDC  has  the  best 
performance  in  terms  of  classi  cation  with  a  AUC  value  of  0.5612. 
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Fig.  24.  Parzen  and  F1-F2  2-D  data  clustering. 


TABLE  VIII 


AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  FI :  A BV 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.8817 

0.8764 

0.8807 

0.8779 

0.8513 

0.8778 

0.8832 

0.8302 

As  it  can  be  seen  from  Tables  VI,  and  VII,  the  best  performing  feature  set  is  the  combination  of  the  three 
features.  We  can  also  conclude  from  Table  X  that,  the  tumor  size  can  not  be  used  to  differentiate  healthy  and 
diseased  tissues  with  an  AUC  value  of  around  0.5.  However,  the  combination  set  of  optical  indices,  obtained 
using  optical  measurements,  can  differentiate  breast  tumors  with  a  relatively  high  precision  with  a  AUC  value  of 
0.9.  Similarly,  optical  indices,  A  BV  and  A  Deoxy,  also  performed  well  with  AUC  values  of  0.883  and  0.879, 
respectively. 

Figures  23,  and  24  show  the  distribution  of  features  A  BV,  and  A  Deoxy  extracted  from  benign  and  malignant 
tumors.  The  thresholds  were  computed  using  scaled  nearest  mean  classi  er  and  Parzen  classi  ers.  Figure  25  presents, 
the  ROC  curves  for  all  three  features,  and  the  best  two  features,  namely,  A  Deoxy  and  A  BV.  The  observed  area 
under  the  ROC  curve  for  F1-F2-F3,  and  F1-F2  are  0.9098,  and  0.9001,  respectively.  Figure  26  presents  the  ROC 
curves  for  individual  features  FI,  and  F2  using  the  nearest  mean  classi  er  .  The  observed  area  under  the  ROC  curve 
for  FI,  and  F2  are  0.8832  and  0.8790,  respectively. 
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TABLE  IX 

AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  F2:  A Deoxy 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.8787 

0.8764 

0.8776 

0.8711 

0.8491 

0.8613 

0.8790 

0.8331 

TABLE  X 

AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  F3:  S 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.5123 

0.5292 

0.4782 

0.5429 

0.5382 

0.5612 

0.5112 

0.4827 

Fig.  25.  ROC  curves  for  F1-F2-F3  and  F1-F2  using  NMSC  Classi  er . 


Fig.  26.  ROC  curves  for  Fland  F2  using  NMC  Classi  er  . 
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III.  Key  Research  Accomplishments 

1)  We  developed  a  method  of  analyzing  the  discretization  error  in  diffuse  optical  image  reconstruction  and 
developed  a  novel  adaptive  mesh  generation  algorithm  based  on  MR  a  priori  anatomical  images.  We  applied 
our  method  to  diffuse  optical  absorption  imaging.  The  results  of  this  work  is  published  in  two  journal  articles 
in  Inverse  Problems  (8), (9). 

2)  We  developed  an  adaptive  mesh  generation  algorithms  for  diffuse  optical  imaging  for  simultaneous  recon¬ 
struction  of  diffusion  and  absorption  images.  This  work  is  currently  being  prepared  as  a  journal  paper. 

3)  We  demonstrated,  for  the  rst  time  in  the  literature,  the  value  of  spatially  resolved  pharmacokinetic-rates  as 
opposed  to  bulk-rates  using  in  vivo  breast  cancer  patient  data.  This  work  is  submitted  as  a  journal  paper  to 
IEEE  Transactions  in  Biomedical  Engineering  (1). 

4)  We  developed  methods  of  reconstructing  pharmacokinetic-rate  images  directly  from  NIR  boundary  measure¬ 
ments  and  applied  our  technique  to  in  vivo  breast  cancer  data.  The  results  show  that  the  technique  is  robust 
and  provides  better  signal-to-noise  ratio  images.  We  are  in  the  process  of  assessing  the  implications  of  this 
results  in  breast  cancer  diagnosis  and  staging. 

5)  We  further  analyze  the  NIR  parameters  collected  from  116  patients  using  NIR  spectroscopy  and  reported  a 
short  version  of  our  results  in  (2).  This  work  is  currently  being  prepared  as  a  journal  paper. 

IV.  Reportable  Outcomes 

Complete  list  of  outcomes  is  given  below: 

1)  B.  Alacam,  B.  Yazici,  X.  Intes,  B.  Chance,  S.  Nioka,  “Pharmacokinetic-Rate  Images  of  Indocyanine  Green  for 
Breast  Tumors  using  Near  Infrared  Optical  Methods,”  in  review  IEEE  Transaction  in  Biomedical  Engineering, 
May  2007. 

2)  B.  Alacam,  B.  Yazici.  B.  Chance,  S.  Nioka,  “Characterization  of  Breast  Tumors  with  NIR  Methods  using 
Optical  Indices,”  to  be  published  in  29th  IEEE  EMBS  Conference,  2007. 

3)  B.  Alacam,  B.  Yazici  X.  Intes,  B.  Chance,  “Extended  Kalman  ltering  for  the  modeling  and  analysis  of 
ICG  pharmacokinetics  in  cancerous  tumors  using  NIR  optical  methods,”  IEEE  Transactions  in  Biomedical 
Engineering,  Vol.  53-10,  pp:  1861-1871,  2006. 

4)  B.  Alacam,  B.  Yazici,  X.  Intes,  S.  Nioka,  B.  Chance,  “Spatially  resolved  pharmacokinetic  rate  images  of  ICG 
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using  Near  Infrared  optical  methods”,  Proc.  of  2006  SPIE  Photonic  West,  San  Jose,  California  USA,  21-26 
January  2006,  vol.  6088.  pp.  455-464. 

5)  B.  Alacam,  B.  Yazici,  X.  Intes,  S.  Nioka,  B.  Chance,  “Direct  reconstruction  of  pharmacokinetic  rate  Images 
of  Indocyanine  Green  in  uorescence  molecular  tomography”  Proceedings  of  Biomedical  Optics  Topical 
Meeting,  Fort  Lauderdale,  Florida,  USA  ,  19-22  March,  2006,  No.  SH65. 

6)  B.  Alacam,  B.  Yazici,  A.  Serdaroglu,  X.  Intes,  B.  Chance,  “Reconstruction  of  spatially  resolved  pharmacoki¬ 
netic  rate  images  of  uorescence  agents  in  FDOT”  Proc.  of  EMBS-28th  Anniversary  Conference,  September, 
2006,  New  York,  pp.  5627  5630. 

7)  B.  Alacam,  B.  Yazici,  X.  Intes,  B.  Chance,  ’’Direct  reconstruction  of  spatially  resolved  pharmacokinetic  rate 
images  of  uorescence  agents”  to  be  submitted  to  IEEE  Transactions  in  Medical  Imaging. 

8)  M.  Guven,  B.  Yazici,  K.  Kwon,  E.  Giladi,  X.  Intes,  Effect  of  discretization  error  and  adaptive  mesh  generation 
in  diffuse  optical  absorption  imaging:  Part  II,”  Inverse  Problems,  Vol.  23,  pp:  1135-1160,  May  2007. 

9)  M.  Guven,  B.  Yazici,  K.  Kwon,  E.  Giladi,  X.  Intes,  Effect  of  discretization  error  and  adaptive  mesh  generation 
in  diffuse  optical  absorption  imaging:  Part  I,”  Inverse  Problems,  Vol.  23,  pp:  1115-1133,  May  2007. 

10)  K.  Kwon,  B.  Yazici,  and  M.  Guven  Two-level  domain  decomposition  methods  for  diffuse  optical  tomography, 
Inverse  Problems,  Vol.  22,  pp:  1533-1559,  2006. 

11)  K.  Kwon  and  B.  Yazici,  Bom  expansion  and  Frechet  derivatives  in  diffuse  optical  tomography,  submitted  to 
Inverse  Problems,  May  2006. 

12)  M.  Guven,  B.  Yazici,  K.  Kwon,  E.  Giladi,  and  X.  Intes,  Adaptive  mesh  generation  for  DOT  to  reduce  the 
error  resulting  from  discretization”,  IEEE  Symposium  in  Biomedical  Imaging,  April  2007.  invited. 

13)  M.  Guven,  B.  Yazici  and  V.  Ntziachristos,  Fluorescence  diffuse  optical  Image  reconstruction  with  a  priori 
information,  Proceedings  of  SPIE  Photonics  West  2007.  Invited. 

V.  Conclusions 

In  the  last  12  months,  we  continued  to  analyze  the  116  breast  cancer  patient  data  obtained  using  NIR  spectroscopy 
and  re  ne  our  imaging  algorithms.  We  showed  that  excluding  the  pharmacokinetic  rate  images,  the  NIR  parameters 
alone  can  provide  an  AUC  value  of  0.9  or  higher.  Additionally,  we  analyzed  the  three  in  vivo  breast  cancer  patient 
data  acquired  using  a  dynamic  time-resolved  tomographic  system.  The  patients  were  injected  ICG  (an  optical 
contrast  agent)  and  diffuse  optical  tomographic  data  was  collected  before  and  after  the  injection.  We  reconstructed 
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the  spatially  resolved  pharmacokinetic-rates  of  ICG  using  variety  of  techniques  and  analyzed  the  resulting  images 
in  terms  of  diagnostic  value.  Our  analysis  shows  that  the  spatially-resolved  pharmacokinetic  rates  provide  superior 
diagnostic  information  in  terms  of  speci  city  and  sensitivity  as  compared  to  a  single  set  of  pharmacokinetic-rates 
obtained  using  spectroscopic  techniques. 

We  continued  to  re  ne  our  NIR  imaging  algorithms.  We  published  a  novel  adaptive  meshing  algorithms  for  diffuse 
optical  absorption  imaging  using  the  MR  a  priori  information.  We  extended  our  algorithms  for  simultaneous  recon¬ 
struction  of  diffusion  and  absorption  images.  We  also  developed  methods  of  pharmacokinetic-rate  images  directly 
from  NIR  boundary  measurements  which  provide  robust  reconstruction  with  higher  SNR  than  the  pharmacokinetic- 
rate  images  obtained  from  absorption  images.  We  are  currently  assessing  the  value  of  this  technique  in  breast  cancer 
diagnosis  in  terms  of  sensitivity  and  speci  city  . 
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Abstract 


In  this  paper,  we  develop  a  method  of  forming  pharmacokinetic -rate  images  of  indocyanine  green 
(ICG)  and  apply  our  method  to  three  sets  of  in  vivo  data  obtained  from  breast  cancer  patients.  To  form 
pharmacokinetic -rate  images,  we  first  obtain  a  sequence  of  ICG  concentration  images  using  the  differential 
diffuse  optical  tomography  technique.  We  next  employ  a  two-compartment  model  composed  of  plasma, 
and  extracellular-extravascular  space  (EES),  and  estimate  the  pharmacokinetic -rates  and  concentrations  in 
each  compartment  using  the  extended  Kalman  filtering  framework.  The  pharmacokinetic-rate  images  of 
the  three  patient  show  that  the  rates  from  inside  and  outside  the  tumor  region  are  statistically  different  with 
very  low  probability  of  false  negative  and  false  positive.  Additionally,  the  ICG  concentrations  in  plasma, 
and  the  EES  compartments  are  higher  around  the  tumor  region  tumor  agreeing  with  the  hypothesis  that 
around  the  tumor  region  ICG  may  act  as  a  diffusible  extravascular  flow  in  compromised  capillary  of 
cancer  vessels.  Our  study  shows  that  the  pharmacokinetic -rate  images  provide  superior  information  than 
single  set  of  pharmacokinetic-rates  estimated  from  the  entire  breast  tissue  for  breast  cancer  diagnosis. 

Keywords:  Pharmacokinetic  analysis,  indocyanine  green,  differential  diffuse  optical  tomography,  breast  cancer. 
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I.  INTRODUCTION 


Near  infrared  (NIR)  diffuse  optical  imaging  offers  several  advantages  over  other  imaging  modalities 
[1] — [7].  NIR  techniques  are  minimally  invasive,  easy  to  use,  relatively  inexpensive,  and  can  be  made 
portable.  Moreover,  optical  techniques,  when  coupled  with  contrast  agents,  have  the  potential  to  pro¬ 
vide  molecular/cellular  level  information,  which  can  improve  cancer  detection,  staging,  and  treatment 
monitoring  [3],  [4],  [8]— [10]. 

Among  many  commercially  available  optical  contrast  agents,  only  indocyanine  green  (ICG)  is  approved 
for  use  in  humans  by  the  Food  and  Drug  Administration  [1 1] — [13].  ICG  is  a  blood  pooling  agent  that 
has  different  delivery  behavior  between  normal  and  cancer  vasculature.  In  normal  tissue,  ICG  acts  as 
a  blood  flow  indicator  in  tight  capillaries  of  normal  vessel.  However  in  cancerous  tumor,  ICG  may 
act  as  a  diffusible  (extravascular)  flow  in  leaky  capillary  of  cancer  vessels  [8],  [10],  [14].  Therefore, 
pharmacokinetics  of  ICG  has  the  potential  to  provide  new  tools  for  tumor  detection,  diagnosis,  and 
staging. 

One  approach  to  analyze  pharmacokinetics  of  contrast  agents  is  the  compartmental  modeling  [15]— 

[17] .  A  number  of  studies  using  compartmental  modeling  were  reported  to  show  the  feasibility  of  ICG 
pharmacokinetics  in  tumor  characterization  [8]— [10].  Cuccia  et  al.  [8]  presented  a  study  of  the  dynamics 
of  ICG  in  an  adenocarcinoma  rat  tumor  model  using  a  two-compartment  model.  Intes  et  al.  [9]  presented 
the  uptake  of  ICG  in  breast  tumors  using  a  continuous  wave  diffuse  optical  tomography  apparatus  using 
a  two-compartment  model.  We  recently  introduced  the  extended  Kalman  filtering  (EKF)  framework  to 
model  and  estimate  the  ICG  pharmacokinetics  and  tested  three  different  compartmental  models  for  the 
ICG  pharmacokinetics  using  the  in  vivo  NIR  data  collected  from  Fischer  rats  with  cancerous  tumors  [10]. 
Our  study  suggests  that  the  pharmacokinetic-rates  out  of  the  vasculature  are  higher  in  edematous  tumors 
as  compared  to  necrotic  tumors. 

In  all  the  studies  described  above,  the  pharmacokinetic-rates  are  assumed  to  be  constant  over  a  tissue 
volume  that  may  be  as  large  as  the  entire  imaging  domain.  However,  pharmacokinetic-rates  are  expected 
to  be  different  in  healthy  and  cancerous  tissue  as  reported  in  positron  emission  tomography  (PET),  and 
magnetic  resonance  imaging  (MRI)  literature.  In  [18]-[20],  it  was  shown  that  the  spatially  resolved 
pharmacokinetic-rates  increase  sensitivity  and  specificity  for  breast  cancer  diagnosis.  For  example,  in 

[18] ,  Sun  et  al.  showed  that  FAU  (l-2’-deoxy-2’-fluoro-  (3  -  D  -  a  rab  inf  u  ran  o  sy  1  urasil,  a  PET  contrast  agent) 
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accumulation  in  tumor  region  is  significantly  higher  when  compared  to  normal  breast  tissue  based  on 
pharmacokinetic-rate  images.  Mussurakis  et  al.  [19]  showed  that  the  pharmacokinetics  of  gadolinium- 
DTPA  (an  MRI  contrast  agent)  can  be  used  to  differentiate  between  malignant  and  benign  breast  tumors 
with  a  high  accuracy.  It  has  also  been  shown  that  the  spatially  resolved  image  interpretation  is  superior 
to  the  isolated  use  of  quantitative  pharmacokinetic -rates.  Since  gadolinium-DTPA  kinetics  is  expected  to 
be  similar  to  ICG,  [19],  and  [20]  suggest  that  the  spatially  resolved  ICG  pharmacokinetic  analysis  may 
provide  better  sensitivity  and  specificity. 

In  the  area  of  diffuse  NIR  spectroscopy  and  imaging,  a  number  of  studies  on  spatially  resolved 
pharmacokinetic-rates  has  been  reported  [21],  [22],  Gurfinkel  et  al.  [21]  presented  in  vivo  NIR  reflectance 
images  of  ICG  pharmacokinetics  to  discriminate  canine  adenocarcinoma  (located  at  0.5-1  cm  depth) 
from  normal  mammary  tissue.  These  images  were  generated  by  a  non-tomographic  technique  using  a 
CCD  camera  that  is  suitable  only  to  image  tumors  close  to  surface.  In  [22],  Milstein  et  al.  presented  a 
Bayesian  tomographic  image  reconstruction  method  to  form  pharmacokinetic -rate  images  of  ICG  based 
on  fluorescence  diffuse  optical  tomography.  Numerical  simulations  show  that  the  method  provides  good 
contrast.  However,  no  real  data  experiments  were  presented  to  study  the  clinical  value  of  spatially  resolved 
pharmacokinetic-rates. 

In  this  paper,  we  present  a  method  of  forming  pharmacokinetic-rate  images  and  report  spatially  resolved 
pharmacokinetic-rates  of  ICG  using  the  in  vivo  NIR  data  acquired  from  three  breast  cancer  patients. 
To  the  best  of  our  knowledge,  our  work  is  the  first  presenting  the  pharmacokinetic -rate  images  of  an 
optical  contrast  agent  using  in  vivo  breast  data  based  on  tomographic  techniques.  We  first  develop  a 
set  of  spatio-temporally  resolved  ICG  concentration  images  based  on  differential  diffuse  optical  tomog¬ 
raphy.  We  model  the  ICG  pharmacokinetics  by  a  two-compartment  model  composed  of  plasma  and 
extracellular-extravascular  space  (EES)  compartments.  We  then  estimate  the  ICG  pharmacokinetic-rates 
and  the  concentrations  in  different  compartments  based  on  the  EKF  framework  [10].  We  show  that  the 
pharmacokinetic-rates  from  inside  and  outside  the  tumor  region  are  statistically  different.  Total  error 
values  using  the  Bayesian  minimum  error  classifier  show  that  spatially  resolved  pharmacokinetic-rate 
analysis  provides  very  good  sensitivity  and  specificity.  We  also  estimate  a  single  set  of  pharmacokinetic- 
rates  (bulk  pharmacokinetic-rates)  for  the  entire  breast  tissue.  Our  study  indicates  that  spatially  resolved 
pharmacokinetic-rates  provides  more  consistent  and  superior  diagnostic  information  as  compared  to  the 
bulk  pharmacokinetic-rates. 
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The  rest  of  the  paper  is  organized  as  follows:  In  Section  II,  we  present  the  reconstruction  of  ICG 
concentration  images.  In  Section  III,  we  present  modeling  and  estimation  of  ICG  pharmacokinetics  using 
the  EKF  framework.  In  Section  IV,  we  describe  the  clinical  results  obtained  from  breast  cancer  patients. 
Section  V  summarizes  our  results  and  conclusion. 

II.  Reconstruction  of  Bulk  ICG  Concentration  Images 

In  our  data  collection  process,  a  sequence  of  boundary  measurements  are  collected  over  a  period 
of  time.  Each  set  of  measurements  are  used  to  form  a  frame  of  the  ICG  concentration  images.  The 
resulting  sequence  of  ICG  concentration  images  are  then  used  to  form  pharmacokinetic -rate  images.  To 
reconstruct  each  frame  of  the  ICG  concentration  images,  we  follow  a  static  reconstruction  approach  and 
use  differential  diffuse  optical  tomography  (DDOT)  technique  [9],  [23]. 

In  DDOT,  two  sets  of  excitation  measurements  are  collected  corresponding  to  before  and  after  the 
ICG  injection,  and  the  ICG  concentration  is  determined  by  the  perturbation  method  [9],  [23].  The  photon 
propagation  before  and  after  the  injection  is  modeled  by  the  following  diffusion  equation: 

V -[DxV^(r,uj) -[^x(r)  + ju;/c]^(r,uj)  =  0,  r  €  D  CM3  (1) 

with  Robin- type  boundary  conditions: 

2DX-^-  +  =  -S(r,w),  r  <E  <9D.  (2) 

where  x  stands  for  the  excitation,  c  is  the  speed  of  light  inside  the  medium  Q;  ui  denotes  the  modulation 
frequency  of  the  source,  /r~x(r)  and  /jV,(r)  are  the  absoiption  coefficients  before  and  after  the  ICG 
injection,  Dx  is  the  diffusion  coefficient  which  is  assumed  to  be  constant,  <3?x(r,  ui)  optical  field  at 
location  r  before  and  after  the  ICG  injection.  Here,  v  denotes  the  outward  normal  to  the  boundary  Oil 
of  fl,  p  is  a  constant  representing  the  refractive  index  mismatch  between  the  two  regions  separated  by 
dQ,  and  Sir.  uj)  is  the  excitation  source  on  the  boundary. 

The  absorption  coefficient  after  the  injection  are  modeled  as  a  sum  of  the  absorption  coefficient 
of  the  medium  before  the  ICG  injection  fx~x  and  the  perturbation  caused  by  the  ICG  Afiax(r)\ 

A Hax(r)  =  ntx(.r)  -  n~x(r),  r  G  D  C  M3.  (3) 

In  the  forward  model,  the  analytical  solutions  of  the  heterogonous  diffusion  equation  given  in  (1)  is 
derived  using  first  order  Rytov  approximation  [9].  The  sample  volume  is  divided  into  a  set  of  voxels  and 
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the  measurements  are  related  to  the  relative  absorption  coefficients  of  each  voxel  by  a  system  of  linear 
equations.  The  shape  of  the  breast  was  approximated  as  a  cylinder  and  the  Kirchhoff  approximation  [24], 
[25]  for  diffuse  waves  was  used  to  model  the  interaction  of  light  with  boundaries.  In  order  to  account 
for  the  biological  noise,  the  forward  model  was  implemented  with  coupling  coefficient  technique  [26]. 

Here,  the  Rytov-type  measurements,  which  are  defined  by  the  natural  logarithm  of  the  ratio  of  the 
post-ICG  measurements  to  the  pre-ICG  measurements  were  used  [23].  Let  ^x(rd,  oj\  rs)  denote  the 
Rytov-type  measurements  at  location  rd  due  to  source  at  rs.  The  linearized  relationship  between  the 
differential  absoiption  coefficient  and  measurements  is  given  by  [27], 

'S'x(rd,u;rs)  =  -  - -[  G~{r  -  rd,  u;  rs)Ix{r)$~{r,  u>;  rs)d3r  (4) 

(rd,u;rs)  Jn 

where  4>“(r,  cu;  rs)  is  the  photon  density  obtained  at  the  excitation  wavelength  before  ICG  injection, 
Ix(r)  =  cApax(r)  /  Dx,  and  G~(r  —  rd,  tu;  rs)  is  the  Green’s  function  of  (1)  for  a  source  at  rs  before  the 
injection  describing  the  propagation  of  light  from  the  heterogeneity  r  to  the  detector  at  rd. 

We  address  the  inverse  problem  of  recovering  A fiax  from  Rytov-measurements  based  on  the 
forward  model  (4)  using  the  singular  value  decomposition  of  the  Moore-Penrose  generalized  system.  We 
use  a  zeroth-order  Tikhonov  regularization  to  stabilize  the  inversion  procedure.  We  apply  the  L-curve 
method  to  an  experimental  model  reconstruction  and  derive  the  best  regularization  parameter  using  a 
curvature  function  as  described  in  [28].  A  detailed  discussion  of  the  forward  and  inverse  models  used 
for  the  reconstruction  of  differential  absorption  coefficients  (A pax)  can  be  found  in  [9]. 

To  construct  a  set  of  ICG  concentration  images,  we  use  the  linear  relationship  between  the  differential 
absorption  coefficients  and  ICG  concentrations  [29]: 

A na{r)  =  In  10  e\  m(r)  =  2.303  e\  m(r)  (5) 

where  e\  is  the  extinction  coefficient  of  ICG  at  the  wavelength  805nm,  m(r)  is  the  bulk  ICG  concentration 
in  the  tissue,  and  Apa{r)  is  as  defined  in  (3). 

III.  Modeling  and  Estimation  of  ICG  Pharmacokinetics 
A.  Two-compartment  Model  of  ICG  Pharmacokinetics 

Using  the  method  outlined  in  Section  2,  we  reconstruct  a  sequence  of  ICG  concentration  images.  As 
an  example,  Figure  3  shows  a  set  of  images  reconstructed  from  in  vivo  breast  data. 
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Our  objective  is  to  model  the  pharmacokinetics  of  ICG  at  each  voxel  of  ICG  concentration  images 
using  compartmental  modeling.  To  do  so,  we  first  extracted  the  time  varying  ICG  concentration  curves 
for  each  voxel  from  the  sequence  of  ICG  concentration  images.  An  example  of  such  a  curve  is  shown 
in  Figure  6.  We  next  fit  a  two-compartment  model  to  each  ICG  concentration  curve  [10],  [21].  Figure  1 
shows  the  two-compartment  model  for  ICG  kinetics.  Flere,  plasma  and  extracellular-extravascular  space 
(EES)  constitute  the  two  compartments.  Cp  and  Ce  represent  the  ICG  concentrations  in  plasma  and  the 
EES,  respectively.  The  rates  kin  and  kout  govern  the  leakage  into  and  the  drainage  out  of  the  EES.  The 
parameter  keim  describes  the  ICG  elimination  from  the  body  through  kidneys  and  livers. 

Using  the  two-compartment  model  introduced  in  [10],  ICG  transition  between  plasma  and  the  EES 
can  be  modeled  as  follows: 


Ce(t) 

kout  kin 

Ce(t) 

cP(t )  _ 

kout  ( kin  keim ) 

_  cP(t )  _ 

(6) 


where  Cp(t)  and  Ce(t )  represent  the  ICG  concentrations  in  plasma  and  the  EES  at  time  t  e  [T0,Ti], 
respectively.  The  rates  kin,  kout,  and  keim  have  a  unit  of  sec-1.  They  are  defined  as  the  permeability 
surface  area  products  given  by  PSp,  where  P  is  the  capillary  permeability  constant,  S  is  the  capillary 
surface  area,  and  p  is  the  tissue  density.  Here,  u(t)  is  uncorrelated  zero-mean  Gaussian  process  with 
covariance  matrix  Q  representing  the  model  mismatch. 

The  actual  total  ICG  concentration  in  the  tissue  is  a  linear  combination  of  plasma  and  the  EES  ICG 
concentrations  modeled  as: 


m{t) 


Ve  Vp 

Ce(t) 

Cp{t) 

+  v(t) 


(V) 


where  m{t),  Ce{t),  and  Cp(t)  are  defined  in  (5)  and  (6);  vp  and  ve  are  plasma  and  the  EES  volume 
fractions,  respectively;  and  77(f)  is  uncorrelated  zero-mean  Gaussian  process  with  covariance  matrix  R, 
representing  the  measurement  noise. 


B.  Estimation  of  ICG  Pharmacokinetics  using  Extended  Kalman  Filtering 
In  matrix-vector  notation,  (6)  and  (7)  can  be  expressed  as: 


C(i)  =  K(a)C(f)  +  w(f), 
m{t)  =  V  (a)C(f)  +  77(f), 


(8) 
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where  C (t)  denotes  the  concentration  vector  with  elements  Ce(t ),  and  Cp(t)\  K(a)  is  the  system  matrix, 
V(a)  is  the  measurement  matrix  as  defined  in  equation  (7),  and  a  is  the  parameter  vector  given  by 

&  —  [kout  kin  keim  ve  Vp\  .  (9) 

The  ICG  measurements  in  (8)  are  collected  at  discrete  time  instances,  t  =  kT,  k  =  0,1, where  T 
is  the  sampling  period.  Therefore,  the  continuous  model  described  in  (8)  is  discretized.  We  can  express 
the  discrete  compartmental  model  as  follows: 


Cd(k  +  l)  =  Kd{G)Cd(k)  +  ud(k), 


(10) 


m  (k)  =  \d(0)C  d(k)  +  Vd(k), 


where  K.d(6)  =  eK-°7  is  the  discrete  time  system  matrix;  \ ,i(0)  =  V(a)  is  the  discrete  measurement 
matrix;  c cd(k)  and  r]d(k)  are  zero-mean  Gaussian  white  noise  processes  with  covariances  matrix  Qd 
and  variance  Rd,  respectively.  The  vector  6  is  composed  of  parameters  which  are  functions  the 
pharmacokinetic-rates  and  volume  fractions: 


e  = 


-i  T 


Til  T12  r2l  T2  2  Ve 


(11) 


We  first  estimate  r^’s,  i.  j  =  1,2  and  then  compute  the  pharmacokinetic-rates  kin,  kout  and  keim  [10], 
[30].  The  explicit  form  of  the  discrete  state-space  model  is  given  as  follows: 


Ce(k  +  1) 

Til  T12 

ce(k) 

Cp(k+  1)  _ 

1 

.s1 

to 

1 _ 

_  cp(k)  _ 

+  ud{k) 


(12) 


m(k) 


Ve  Vp 

ce(k) 

_  cp(k )  _ 

+  Vd(k). 


We  estimate  the  parameter  vector  6  and  concentration  vector  Cd  by  using  the  EKF  framework.  The 
EKF  is  a  recursive  modeling  and  estimation  method  with  numerous  advantages  in  ICG  pharmacokinetic 
modeling  [10].  These  include  effective  modeling  of  multiple  compartments,  and  multiple  measurement 
systems  in  the  presence  of  measurement  noise  and  uncertainties  in  the  compartmental  model  dynamics, 
simultaneous  estimation  of  model  parameters  and  ICG  concentrations  in  each  compartment,  statistical 
validation  of  estimated  concentrations  and  error  bounds  on  the  model  parameter  estimates,  and  incor¬ 
poration  of  available  a  priori  information  about  the  initial  conditions  of  the  permeability  rates  into  the 
estimation  procedure. 


When  both  states  (ICG  concentrations)  and  model  parameters  (pharmacokinetic-rates  and  volume 
fractions)  are  unknown,  a  linear  state-space  model  can  be  regarded  as  a  non-linear  model;  the  linear 
system  parameters  and  states  combine  to  form  the  new  states  of  the  non-linear  model.  This  system  is 
then  linearized  and  the  new  unknown  states  are  found  using  the  EKF  estimator  [10],  [3 1]— [33].  In  EKF 
framework,  6  can  be  treated  as  a  random  process  with  the  following  model: 

O(k  +  l)  =  0(k)  +  sd{k),  (13) 

where  Sd(k)  is  a  zero-mean  Gaussian  process  with  covariance  matrix  S,/. 

Table  1  summarizes  the  joint  estimation  of  pharmacokinetic-rates  and  ICG  concentration  in  different 
compartments.  In  Table  1,  Cd(k\k  —  1)  is  the  state  estimate  propagation  at  step  k  given  all  the  measure¬ 
ments  up  to  step  k  —  1;  C-d(k)  is  the  state  estimate  update  at  step  k\  i  denotes  the  error  covariance 
propagation  at  step  k  given  all  the  measurements  up  to  step  k  —  1;  P/../,:  is  the  error  covariance  update 
at  step  A:;  S d  is  the  preassigned  covariance  matrix  of  (/,:);  J /,  is  the  Jacobian  matrix  due  to  iterative 
linearization  of  the  state  equation  at  step  A:;  G /,.  is  the  recursive  Kalman  gain  at  step  k\  R,/  is  the  covariance 
matrix  of  the  measurements;  Qd  is  the  covariance  matrix  of  the  concentration  vector;  and  I  is  the  identity 
matrix.  A  detailed  discussion  of  the  Kalman  filtering  algorithm  for  the  joint  estimation  of  the  model 
parameters  and  ICG  concentrations  in  different  compartments  can  be  found  in  [10]. 

IV.  Clinical  Results 

A.  Apparatus 

In  this  work,  we  use  the  data  collected  with  a  continuous  wave  (CW)  NIR  imaging  apparatus.  The 
apparatus  has  16  light  sources,  which  are  tungsten  bulbs  with  less  than  1  watt  of  output  power.  They  are 
located  on  a  circular  holder  at  an  equal  distance  from  each  other  with  22.5  degrees  apart.  Sixteen  detectors, 
namely,  silicon  photodiodes,  are  situated  in  the  same  plane.  The  breast  is  arranged  in  a  pendular  geometry 
with  the  source-detector  probes  gently  touching  its  surface.  Figure  2  illustrates  the  configuration  of  the 
apparatus  and  the  configuration  of  the  detectors  and  the  sources  in  a  circular  plane.  Note  that  sources 
and  detectors  are  co-located.  The  detectors  use  the  same  positions  as  the  sources  to  collect  the  light 
originating  from  one  source  at  a  time.  Only  the  signals  from  the  farthest  11  detectors  are  used  in  the 
analysis.  For  example,  when  Source  1  is  on,  the  data  is  collected  using  Detectors  4  to  14.  A  band  pass 
filter  at  805  nm,  the  absoiption  peak  of  ICG,  is  placed  in  front  of  the  sources  to  select  the  desired 
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wavelength.  A  set  of  data  for  one  source  is  collected  every  ~  500  ms.  The  total  time  for  a  whole  scan 
of  the  breast  including  16  sources  and  16  detectors  is  ~  8.8  seconds.  A  more  detailed  explanation  of  the 
apparatus  and  the  data  collection  procedure  can  be  found  in  [34]. 

B.  Protocol 

Patients  with  suspicious  breast  tumors  are  enrolled  for  this  study.  ICG  is  injected  intravenously  by  bolus 
with  a  concentration  of  0.25  mg  per  kg  of  body  weight.  Diagnostic  information  is  obtained  using  biopsy 
results.  Since  biopsy  modifies  the  blood  volume  and  blood  flow  around  the  tumor  region,  measurements 
are  made  prior  to  biopsy.  Data  acquisition  started  before  the  injection  of  ICG  and  continued  for  10 
minutes. 

C.  Tumor  Information 

Three  different  patients  with  different  tumor  types  are  included  in  this  study.  First  case,  (Case  1),  is 
fibroadenoma,  which  corresponds  to  a  mass  estimated  to  be  1—2  cm  in  diameter  within  a  breast  of  9 
cm  diameter  located  at  6-7  o’clock.  Second  case,  (Case  2),  is  adenocarcinoma  corresponding  to  a  tumor 
estimated  to  be  2—3  cm  in  diameter  within  a  breast  of  7.7  cm  diameter  located  at  4-5  o’clock.  The  third 
case,  (Case  3),  is  invasive  ductal  carcinoma,  which  corresponds  to  a  mass  estimated  to  be  4  by  3  cm 
located  at  6  o’clock.  Table  2  describes  the  tumor  information  for  each  patient. 

D.  ICG  Concentration  Measurements  for  Pharmacokinetic  Parameter  Estimations 

Using  the  CW  imager  described  above,  sufficient  number  of  source-detector  readings  are  collected 
from  different  angles  for  each  patient.  Differential  absorption  images  are  reconstructed  based  on  DDOT 
forward  model  given  in  (4).  Using  the  linear  relationship  (5)  between  ICG  concentration,  and  absorption 
coefficient,  ICG  concentration  images  are  obtained  for  each  case.  A  sample  set  of  ICG  concentration 
images  for  the  selected  time  instants  are  shown  in  Figure  3,  4,  and  5  for  Case  1,  2,  and  3,  respectively. 
Although  only  9  images  are  displayed,  there  are  around  50  images  for  each  case,  each  corresponding  to 
a  different  time  instant.  Each  image  is  composed  of  649  voxels.  Note  that  the  ICG  concentration  images 
in  Figures  3,  4,  and  5  represent  the  bulk  ICG  concentrations  in  the  tissue,  not  the  ICG  concentrations  in 
plasma  or  in  the  EES  compartments. 
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We  next  extract  the  time  course  of  ICG  concentration  for  each  voxel.  As  an  example,  Figure  6  shows 
the  time  course  of  ICG  concentrations  for  all  three  cases  for  a  specific  voxel  around  the  tumor  region 
(65th,  216th ,  1 8 8 ; voxel  for  Case  1,  Case  2,  and  Case  3,  respectively).  We  then  fit  the  two-compartment 
model  to  each  time  course  data  using  the  EKF  framework;  and  estimate  kin,  kout,  keim,  and  the  ICG 
concentrations  in  plasma  and  EES.  Spatially  resolved  images  of  kin,  and  kout  for  each  case  are  shown 
in  Figures  8(a)-(b),  and  9(a)-(b),  10(a)-(b),  respectively.  Additionally,  we  construct  ICG  concentration 
images  for  plasma  and  the  EES  compartments.  Figures  11-16  show  the  ICG  concentration  in  plasma 
and  the  EES  for  3  different  time  instants  for  Case  1,  2,  and  3,  respectively.  Our  results  show  that 
the  pharmacokinetic -rates  are  higher  around  the  tumor  region  agreeing  with  the  fact  that  permeability 
increases  around  the  tumor  region  due  to  compromised  capillaries  of  cancer  vessels.  We  also  observed 
that  ICG  concentrations  in  plasma  and  the  EES  compartments  are  higher  around  the  tumors  agreeing 
with  the  hypothesis  that  around  the  tumor  region  ICG  may  act  as  a  diffusible  extravascular  flow  in  leaky 
capillary  of  cancer  vessels. 

Using  the  brightest  spot  as  a  reference  point,  the  a  priori  information  on  the  location,  and  the  size  of 
the  tumors,  we  plot  an  ellipse  (or  a  circle)  to  identify  the  approximate  location  and  size  of  the  tumor 
in  the  pharmacokinetic-rate  images.  Figures  17(a),  (b),  and  (c)  present  the  kin  images  with  approximate 
tumor  location  and  size  for  Case  1,  2,  and  3,  respectively.  The  consistency  of  the  bright  regions  in 
the  kin  images,  and  circular/elliptical  regions  drawn  based  on  the  a  priori  information  shows  that  the 
pharmacokinetic-rate  images  provide  good  localization  of  tumors  and  size  information. 

The  histograms  of  kin  and  kout  rates  from  inside  and  outside  the  tumor  region  (as  indicated  by 
circular/elliptical  regions)  are  shown  in  Figures  18(a),  (b),  and  (c),  and  Figures  19(a),  (b),  and  (c), 
respectively.  The  solid  curves  in  Figures  18  and  19  show  the  Gaussian  fit.  Clearly,  pharmacokinetic- 
rate  images  facilitates  analysis  of  the  spatial  variation  of  the  pharmacokinetic -rates  for  breast  cancer 
screening,  diagnosis,  and  staging.  The  histograms  and  their  Gaussian  probability  density  function  fits  in 
Figures  18  and  19  show  that  the  mean  and  the  standard  deviation  of  kin  and  kout  values  are  different 
inside  and  outside  the  tumor  region.  Table  3  tabulates  the  mean  values  (±  spatial  standard  deviation)  of 
the  pharmacokinetic -rates  from  inside  and  outside  the  tumor  regions  for  all  three  cases.  The  total  error 
(probability  of  false  positive  and  false  negative)  using  the  Bayesian  minimum  error  classifier  is  given  by 
the  overlapping  area  between  the  two  Gaussian  fits  [35].  Table  4  tabulates  the  total  Bayesian  error.  These 
small  error  values  show  that  the  spatially  resolved  pharmacokinetics-rate  analysis  provide  good  sensitivity 
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and  specificity  for  diagnosis.  One  other  advantage  of  the  pharmacokinetic-rate  images  is  that  they  allow 
comparison  of  rates  obtained  from  healthy  and  tumor  tissue  for  the  patient  itself  since  absolute  (mean) 
values  may  be  quite  different  from  patient  to  patient.  For  example,  the  kin  rate  for  Case  3  obtained 
outside  the  tumor  region  is  higher  than  the  kj/n  rate  for  Case  1  obtained  inside  the  tumor.  Thus,  using  the 
pharmacokinetic-rates  obtained  from  healthy  tissue  as  a  reference  to  the  tumor  tissue  for  the  same  patient 
can  improve  diagnostic  accuracy.  However,  very  high  mean  values  of  kin  and  kout  may  be  indicative  of 
the  severity  of  malignancy.  The  pharmacokinetic  rates  are  higher  for  Case  3  (invasive  ductal  carcinoma), 
for  both  inside  and  outside  the  tumor  region  as  compared  to  Case  2  (adenocarcinoma).  Similarly,  the 
kinetic  rates  are  higher  for  Case  2  (adenocarcinoma),  as  compared  to  Case  1  (fibroadenoma)  for  both 
inside  and  outside  the  tumor  region. 

To  further  understand  the  value  of  spatially  resolved  pharmacokinetic-rate  analysis  as  compared  to 
the  bulk  pharmacokinetic-rates,  we  averaged  the  concentration  images  spatially,  and  obtained  a  bulk 
concentration  value  for  each  time  instant.  We  then  extracted  a  time  curve  for  the  bulk  ICG  concentrations. 
Figure  7  shows  the  resulting  bulk  ICG  concentration  time  curves  for  each  case.  Next,  we  fitted  the  two- 
compartment  model  to  the  resulting  time  curves  and  estimated  the  bulk  pharmacokinetic -rates.  Table  5 
tabulates  the  bulk  pharmacokinetic -rates  for  each  patient.  In  Figure  20  and  21,  the  bulk  pharmacokinetic- 
rates  are  overlaid  on  the  histograms  of  the  pharmacokinetic-rate  images  obtained  from  inside  and  outside 
the  tumor  region.  Figures  20  and  21  show  the  Bayesian  minimum  error  classifier  threshold  (the  value 
corresponding  to  the  intersection  of  the  histograms)  for  each  case.  We  see  that  for  Case  1,  the  bulk 
rates  of  kin,  and  kout  are  both  classified  as  healthy  tissue  (outside  the  tumor  region).  For  Case  2,  kvn  is 
classified  as  cancerous  tissue  (inside  the  tumor  region)  and  kout  is  classified  as  healthy  tissue.  Similarly 
for  Case  3,  kin  is  classified  as  healthy  and  kout  is  classified  as  cancerous  tissue.  Clearly,  the  bulk  rates 
do  not  provide  consistent  diagnosis.  On  the  other  hand,  spatially  resolved  pharmacokinetic -rates  allow  us 
to  compare  values  from  the  cancerous  region  to  the  healthy  background,  and  provide  higher  sensitivity, 
and  specificity  as  shown  by  the  Bayesian  total  error  values  in  Table  IV. 

V.  Conclusion 

In  this  study,  we  presented  a  method  of  forming  pharmacokinetic-rate  images  and  reported  spatially 
resolved  pharmacokinetic-rates  of  ICG  for  three  breast  cancer  patients.  To  form  pharmacokinetic-rate 
images,  we  first  obtained  a  sequence  of  ICG  concentration  images  using  the  differential  diffuse  optical  to- 
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mographic  technique.  We  next  employed  the  two-compartment  model,  and  estimated  the  pharmacokinetic- 
rates  and  concentrations  in  each  compartment  for  each  voxel  using  the  EKF  framework.  We  have  shown 
in  our  prior  work  [10]  that  the  EKF  framework  has  a  number  of  advantages  in  pharmacokinetic-rate 
estimation,  some  of  which  include  robust  estimation  in  the  presence  of  measurement  noise  and  dynamic 
model  uncertainties. 

The  reconstruction  of  the  pharmacokinetic-rate  images  presented  in  this  work  were  obtained  in  separate, 
decoupled  steps.  The  quantitative  accuracy  of  pharmacokinetic-rate  images  can  be  improved  by  coupling 
the  two  reconstruction  steps  which  can  better  capture  the  temporal  and  spatial  variations  in  the  ICG 
concentrations  and  pharmacokinetic-rates  and  provide  robust  reconstruction  by  eliminating  outliers.  Such 
a  direct  reconstruction  technique  is  currently  being  developed  in  our  laboratory  and  will  be  reported  in 
the  near  future. 

We  formed  the  pharmacokinetic-rate  images  using  the  in  vivo  data  obtained  from  three  breast  can¬ 
cer  patients  and  studied  the  value  of  spatially  resolved  pharmacokinetic-rates.  A  clear  advantage  of 
pharmacokinetic-rate  images  is  that  they  allow  comparison  of  the  features  obtained  form  the  tumor  region 
to  the  ones  obtained  from  the  healthy  background  for  the  same  patient.  This  within  patient  comparison  is 
important  because,  pharmacokinetic-rates  for  healthy  tissue  may  be  significantly  different  from  patient  to 
patient.  Furthermore,  spatially  resolved  images  allow  us  to  utilize  not  only  the  average  rates  but  also  other 
features  related  to  the  geometry  of  the  heterogeneity,  and  higher  order  statistics  of  the  pharmacokinetic- 
rates.  The  total  error  for  the  Bayesian  minimum  error  classifier  for  all  three  patients  indicates  that 
the  pharmacokinetic-rate  images  can  provide  high  sensitivity  and  specificity  for  tumor  diagnosis.  The 
comparison  of  the  bulk  pharmacokinetic-rates  to  Bayesian  minimum  error  classifier  threshold  shows  that 
the  diagnostic  information  provided  by  the  bulk  pharmacokinetic-rates  is  not  consistent.  Our  study  shows 
that  the  spatially  resolved  pharmacokinetic -rate  analysis  is  superior  to  the  bulk  pharmacokinetic-rates 
in  tumor  diagnosis.  Our  study  also  indicates  that  similar  results  may  also  hold  for  tumor  staging  and 
treatment  monitoring. 


13 


References 


[1]  X.  Intes,  B.  Chance,  “Non-PET  functional  imaging  techniques:  Optical,”  Radio.  Clin,  of  North  America,  vol.  43-1,  pp. 
221-234,  2005. 

[2]  A.  Yodh  and  B.  Chance,  “Spectroscopy  and  imaging  with  diffusing  light,”  Phys.  Today,  vol.  48-3,  pp.  34-40,  1995. 

[3]  E.  M.  Sevick-Muraca,  G.  Lopez,  T.  L.  Troy,  J.  S.  Reynolds,  and  C.  L.  Hutchinson,  “Fluorescence  and  absorption  contrast 
mechanisms  for  biomedical  optical  imaging  using  frequency-domain  techniques,  Photochem.  Photobiol.,  vol.  66,  pp.  55, 
1997. 

[4]  U.  Mahmood,  C.H.  Tung,  A.  Bogdanov  Jr.,  and  R.  Weissleder,“Near  infrared  optical  imaging  of  protease  activity  for  tumor 
detection,”  Radiology,  vol.  213-3,  pp.  866-870,  1999. 

[5]  S.R.  Arridge,  “Optical  tomography  in  medical  imaging,”  Inverse  Probl.  vol.  15,  pp.  41-93,  1999. 

[6]  S.  A.  Boppart,  W.  Luo,  D.  L.  Marks,  K.  W.  Singletary,  “Optical  coherence  tomography:  feasibility  for  basic  research  and 
image-guided  surgery  of  breast  cancer,”  Breast  Cancer  Res  Treat.,  vol.  84-2,  pp.  85-97,  2004. 

[7]  X.  Gu,  Q.  Zhang,  L.  Larcom,  and  H.  Jiang,  “Three-dimensional  bioluminescence  tomography  with  model-based  reconstruc¬ 
tion,”  Optics  Express,  vol.  12,  Issue  17,  pp.  3996-4000,  2004. 

[8]  D.J.  Cuccia,  F.  Bevilacqua,  A.  J.  Durkin,  S.  Merritt,  B.  J.  Tromberg,  G.  Gulsen,  H.  Yu,  J.  Wang,  and  O.  Nalcioglu,  “In 
vivo  quantification  of  optical  contrast  agent  dynamics  in  rat  tumors  by  use  of  diffuse  optical  spectroscopy  with  magnetic 
resonance  imaging  coregistration,”  Applied  Optics,  vol.  42-1,  pp.  2940-2950,  2003. 

[9]  X.  Intes,  J.  Ripoll,  Y.  Chen,  S.  Nioka,  A.  G.  Yodh,  B.  Chance,  “ In  vivo  continuous-wave  optical  breast  imaging  enhanced 
with  Indocyanine  Green,”  Med.  Phys.  vol.  30-6,  pp.  1039-1047,  2003. 

[10]  B.  Alacarn,  B.  Yazici,  X.  Intes,  B.  Chance,  “Extended  Kalman  Filtering  for  the  Modeling  and  Analysis  of  ICG 
Pharmacokinetics  in  Cancerous  Tumors  using  NIR  Optical  Methods”,  Transactions  in  IEEE  Biomedical  Engineering,  vol. 
53,  no.  10,  pp.  1861-1871,  2006. 

[11]  D.  Hansen,  A.  Spence,  T.  Carski,  and  M.  Berger,  “Indocyanine  green  (ICG)  staining  and  demarcation  of  tumor  margins 
in  a  rat  glioma  model,”  Surg.  Neurol.,  vol.  40,  pp.  451-456,  1993. 

[12]  H.  Shinohara,  A.  Tanaka,  T.  Kitai,  N.  Yanabu,  T.  Inomoto,  S.  Satoh,  Hatano,  Y.  Yamaoka,  and  K.  Hirao,  ’’Direct 
measurement  of  hepatic  Indocyanine  Green  clearance  with  near-infrared  spectroscopy:  separate  evaluation  of  uptake  and 
removal,”  Hepatology,  vol.  23,  pp.  137-144,  1996. 

[13]  A.  ElDeosky,  A.  Seifalian,  M.  Cope,  D.  Delpy,  and  B.  Davidson,  “Experimental  study  of  liver  dysfunction  evaluated  by 
direct  Indocyanine  green  clearance  using  near  infrared  spectroscopy,”  Br.  J.  Surg.,  vol.  86,  pp.  1005-1011.  1999. 

[14]  P.  Vaupel,  K.  Schlenger,  C.  Rnoop,  M.  Hockel,  “Oxygenation  of  human  tumors:  evaluation  of  tissue  oxygen  distribution 
in  breast  cancers  by  computerized  O 2  tension  measurements,”  Cancer  Res.,  vol.  51,  pp.  33163322,  1991. 

[15]  C.  W.  Tornoe,  “Grey-Box  PK/PD  Modeling  of  Insulin,”  M.S.  Thesis,  June  28,  2002. 

[16]  D.  H.  Anderson,  Lecture  Notes  in  Biomathematics:  Compartmental  Modeling  and  Tracer  Kinetics,  Springer- Verlag,  Berlin, 
1983. 

[17]  J.  A.  Jacquez,  Compartmental  Analysis  in  Biology  and  Medicine:  Kinetics  of  Distribution  of  Tracer-labeled  Materials, 
Elsevier  Pub.  Co.,  New  York,  1972. 


14 


[18]  H.  Sun,  J.  M.  Collins,  T.  J.  Mangner,  O.  Muzik,  A.  Shields,  “Imaging  the  pharmacokinetics  of  [F-18]FAU  in  patients  with 
tumors:  PET  studies,”  Cancer  Chemother.  Pharmacol ,  vol.  57-3,  pp.  343-348,  2006. 

[19]  S.  Mussurakis,  D.  L.  Buckley,  P.  J.  Drew,  J.  N.  Fox,  P.  J.  Carleton,  L.  W.  Turnbull,  A.  Horsman,  “Dynamic  MR  imaging  of 
the  breast  combined  with  analysis  of  contrast  agent  kinetics  in  the  differentiation  of  primary  breast  tumours”  Clin  Radiol, 
vol.  52-7,  pp.  516-26,  1997. 

[20]  M.  Y.  Su,  H.  J.  Yu,  P.  M.  Carpenter,  C.  E.  McLaren,  O.  Nalcioglu,  “Pharmacokinetic  parameters  analyzed  from  mr  contrast 
enhancement  kinetics  of  multiple  malignant  and  benign  breast  lesions  detected  in  the  same  patients,”  Technol  Cancer  Res 
Treat.,  vol.  4-3,  pp.  255-63,  2005. 

[21]  M.  Gurfinkel,  A.  B.  Thompson,  W.  Ralston,  T.  L.  Troy,  A.  L.  Moore,  T.  A.  Moore,  J.  D.  Gust,  D.  Tatman,  J.  S.  Reynolds, 
B.  Muggenburg,  K.  Nikula,  R.  Pandey,  R.  H.  Mayer,  D.  J.  Hawrysz,  and  E.  M.  Sevick-Muraca,  “Pharmacokinetics  of  ICG 
and  HPPH-car  for  the  detection  of  normal  and  tumor  tissue  using  fluorescence,  near-infrared  reflectance  imaging:  a  case 
study,”  Photochem.  Photobiol.,  Vol.  72,  pp.  94-102,  2000. 

[22]  A.  B.  Milstein,  K.  J.  Webb,  and  C.  A.  Bouman,  “Estimation  of  kinetic  model  parameters  in  fluorescence  optical  diffusion 
tomography,”  J.  Opt.  Soc.  Am.,  vol.  22-7,  pp.  1357-1368. 

[23]  V.  Ntziachristos,  B.  Chance,  and  A.  Yodh, “Differential  diffuse  optical  tomography,”  Opt.  Express,  vol.  5,  pp.  230-242, 
1999. 

[24]  J.  Ripoll,  V.  Ntziachristos,  R.  Carminati,  and  M.  Nieto- Vesperinas,  “Kirchhoff  approximation  for  diffusive  waves,”  Phys. 
Rev.  vol.  64,  pp.  051917,  2001. 

[25]  J.  Ripoll,  V.  Ntziachristos,  J.  P.  Culver,  D.  N.  Pattanayak,  A.  G.  Yodh,  and  M.  Nieto- Vesperinas,  “Recovery  of  optical 
parameters  in  multiplelayered  diffusive  media:  theory  and  experiments,”  J.  Opt.  Soc.  Am.  vol.  A  18,  pp.  821830,  2001. 

[26]  D.  Boas,  T.  Gaudette,  and  S.  Arridge,  “Simultaneous  imaging  and  optode  calibration  with  diffuse  optical  tomography,” 
Opt.  Express  vol.  8,  pp.  263270,  2001. 

[27]  M.  O’Leary, ‘Imaging  with  Diffuse  Photon  Density  Waves,”  in  PhD  Thesis,  Dept.  Physics  &  Astronomy,  U.  of  Pennsylvania, 
May  1996. 

[28]  P.  Hansen  and  D.  OLeary,  “The  use  of  the  L-curve  in  the  regularization  of  discrete  ill-posed  problems,”  SIAM  J.  Sci. 
Comput.,  vol.  14,  pp.  1487  1503,  1993. 

[29]  M.  L.  J.  Landsman,  G.  Kwant,  G.  A.  Mook,  W.  G.  Zijlstra, “Light-  absorbing  properties,  stability,  and  spectral  stabilization 
of  indocyanine  green,”  J.  Appl.  Physiol.,  vol.  40,  pp.  575-583,  1976. 

[30]  C.  Chen,  Linear  System  Theory  and  Design,  Oxford  University  Press,  New  York,  1999. 

[31]  L.  Ljung,  “Asymptotic  Behavior  of  the  Extended  Kalman  Filter  as  a  Parameter  Estimator  for  Linear  Systems,”  IEEE  Tran. 
Automa.  Control,  vol.  AC-24,  no.  1,  pp.  36-50,  1979. 

[32]  R.  Togneri,  and  L.  Deng,  “Joint  State  and  Parameter  Estimation  for  a  Target-Directed  Nonlinear  Dynamic  System  Model,” 
IEEE  Tran,  on  Sig.  Proc.,  vol.  51,  no.  12,  pp.  3061-3070,  2003. 

[33]  L.  Nelson,  and  E.  Stear,  “The  Simultaneous  On-Line  Estimation  of  Parameters  and  States  in  Linear  Systems,”  IEEE  Tran, 
on  Auto.  Control,  vol.21,  pp.  94-98,  1976. 

[34]  S.  Nioka,  Y.  Yung,  M.  Schnall,  S.  Zhao,  S.  Orel,  C.  Xie,  B.  Chance,  and  S.  Solin, “Optical  imaging  of  breast  tumor  by 
means  of  continuous  waves,”  Adv.  Exp.  Med.  Biol,  vol.  411,  pp.  227-232,  1997. 

[35]  K.  Fukunaga,  Introduction  to  Statistical  Pattern  Recognition,  Academic  Press,  1990. 


15 


TABLE  I 


EKF  ALGORITHM  FOR  SIMULTANEOUS  ESTIMATION  OF  STATES  AND  PARAMETERS. 


Initial  Conditions 


Cd(0) 

'e(  Cd(0)) 

Var(Cd(0))  0 

0(0) 

0(0) 

,  Po.o  = 

0  Sd 

State  Estimate  Propagation 


Cd(fc|fc-1) 
0(k\k-  1) 


Kd{9(k  -  l))Cd(fc  -  1) 
0(k-  1) 


Error  Covariance  Propagation 

State  Estimate  Update 


Error  Covariance  Update 
Kalman  Gain 

Definitions 


Pfc,fc-1  —  Jfc-lPfc-l,fc-lJfc-l  + 


Cd(k) 

Cd(klk-l) 

0{k\k-l) 

+Gfe(m(fc)  -  Yd{0{k\k  -  l))Cd(k\k  -  1)) 
P  k,k  —  [I  Hfc  Afc|/c_i]Pfcife_i 


Hfe  =  Pfc,fe-iAfc|fc_1[Afc|fc_1Pfc,fc_iAfc|fc_1  +  Rd]  1 


Jfc 

K  dim)  ^[K  d(0(k))Cd(k)] 

Afc|fc_i  — 

\d(9{k\k  -  1)) 

0  I 

0 
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TABLE  II 


Tumor  information  for  each  patient 


Tumor  Type 

Tumor  Size 

Tumor  Location 

Case  1 

Fibroadenoma 

1-2  cm 

6-7  o'clock 

Case  2 

Adenocarcinoma 

2-3  cm 

4-5  o'clock 

Case  3 

Invasive  Ductal  Carcinoma 

4  by  3  cm 

6  o'clock 

TABLE  III 

Mean  and  standard  deviation  of  pharmacokinetic-rates  for  inside  and  outside  the  tumor  region 


kin  (see 

-1  icr2) 

kout  (see 

-Ho-2) 

kelm  (see 

“HO"3) 

Inside 

Outside 

Inside 

Outside 

Inside 

Outside 

Case  1 

2.14±0.018 

0.73±0.011 

1.24±0.069 

0.43±0.013 

4.11±0.057 

3.87±0.012 

Case  2 

2.92±0.076 

1.14±0.052 

1.58±0.051 

0.65±0.036 

3.94±0.081 

4.12±0.047 

Case  3 

6.87±0.093 

3.06±0.015 

4.96±0.048 

1.66±0.072 

4.49±0.056 

4.46±0.081 

TABLE  IV 

Total  Error  (probability  of  false  positive  and  false  negative)  in  tumor  diagnosis  Based  on  Bayesian 

minimum  error  classifier 


Total  Error  for  kin 

Total  Error  for  kout 

Case  1 

6.67  x  10"2 

13.34  x  10-2 

Case  2 

3.23  x  10~2 

13.51  x  10~2 

Case  3 

9.09  x  10"2 

3.64  x  10“2 

TABLE  V 

Bulk  Pharmacokinetic-rates  extracted  from  the  entire  breast  tissue 


kirl  ( sec  ^0  2) 

kout  ( sec  HO  2) 

keim  ( sec  HO 

Case  1 

0.84±0.013 

0.62±0.017 

3.66±0.042 

Case  2 

2.01±0.022 

0.83±0.012 

4.01±0.054 

Case  3 

4.06±0.072 

3.36±0.051 

4.37±0.052 
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Fig.  5. 
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Figure  Captions 

Fig.  1.  Block  diagram  of  the  two-compartment  model  for  ICG  pharmacokinetics. 

Fig.  2.  (Left)  schematic  diagram,  (Right)  the  cut  section  of  the  CW  NIR  imaging  apparatus  with  16 
sources  and  detectors. 

Fig.  3.  ICG  concentration  images  for  a  set  of  time  instants  for  Case  1. 

Fig.  4.  ICG  concentration  images  for  a  set  of  time  instants  for  Case  2. 

Fig.  5.  ICG  concentration  images  for  a  set  of  time  instants  for  Case  3. 

Fig.  6.  Time  course  of  ICG  concentration  curves  for  a  specific  voxel,  65th,  216th,  1 88th  voxel  for  Case 

1,  Case  2,  and  Case  3,  respectively. 

Fig.  7.  Averaged  time  course  of  ICG  concentration  curves  for  Case  1,  Case  2,  and  Case  3,  respectively. 
Fig.  8.  Pharmacokinetic-rate  images,  (a)  kin,  and  (b)  kout  for  Case  1. 

Fig.  9.  Pharmacokinetic-rate  images,  (a)  ktn,  and  (b)  kout  for  Case  2. 

Fig.  10.  Pharmacokinetic -rate  images,  (a)  kin,  and  (b)  kout  for  Case  3. 

Fig.  11.  ICG  concentration  images  in  plasma  for  Case  1  for  (a)  246.4th,  (b)  334.4th,  and  (c)  422.4th 
seconds. 

Fig.  12.  ICG  concentration  images  in  the  EES  for  Case  1  for  (a)  246.4th,  (b)  334.4th,  and  (c)  422.4th 
seconds. 

Fig.  13.  ICG  concentration  images  in  plasma  for  Case  2  for  (a)  228.8th,  (b)  316.8th,  and  (c)  404.8th 
seconds. 

Fig.  14.  ICG  concentration  images  in  the  EES  for  Case  2  for  (a)  228.8th,  (b)  316.8th,  and  (c)  404.8th 
seconds. 

Fig.  15.  ICG  concentration  images  in  the  plasma  for  Case  3  for  (a)  246.4th,  (b)  378.4th,  and  (c)  510.4th 
seconds. 

Fig.  16.  ICG  concentration  images  in  the  EES  for  Case  3  for  (a)  246.4th,  (b)  378.4th,  and  (c)  510.4th 
seconds. 

Fig.  17.  The  kin  images  with  approximate  tumor  location  and  size  for  (a)  Case  1,  (b)  Case  2,  and  (c) 
Case  3,  respectively.  An  ellipse/circle  identifies  the  approximate  location,  and  size  of  the  tumor. 

Fig.  18.  The  histograms  of  kin  for  (a)  Case  1,  (b)  Case  2,  (c),  Case  3  from  inside  (blue)  and  outside 
(gray)  the  tumor  region  (as  indicated  by  circular/elliptical  regions).  The  solid  lines  in  figures  show  the 
Gaussian  fit. 
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Fig.  19.  The  histograms  of  kout  for  (a)  Case  1,  (b)  Case  2,  (c),  Case  3  from  inside  (blue)  and  outside 
(gray)  the  tumor  region  (as  indicated  by  circular/elliptical  regions).  The  solid  lines  in  figures  show  the 
Gaussian  fit. 

Fig.  20.  Solid  line  (blue)  shows  bulk  kin  rates  for  (a)  Case  1,  (b)  Case  2,  (c),  Case  3  together  with  the 
histogram  fits.  The  dashed  (red)  line  indicates  the  Bayesian  minimum  error  classifier  threshold. 

Fig.  21.  Solid  line  (blue)  shows  bulk  rates  for  (a)  Case  1,  (b)  Case  2,  (c),  Case  3  together  with  the 
histogram  fits.  The  dashed  (red)  line  indicates  the  Bayesian  minimum  error  classifier  threshold. 
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Abstract — This  work  describes  the  characterization  efficiency 
of  optical  properties  of  breast  tumors  based  on  the  features 
obtained  using  in  vivo  near-infrared  (NIR)  spectroscopy  mea¬ 
surements.  Three  features,  relative  blood  concentration,  oxygen 
saturation  and  the  size  of  the  tumor,  are  used  to  diagnose  benign 
and  malignant  tumors.  The  performance  of  the  proposed  set  of 
features  are  evaluated  by  various  classifiers  using  data  acquired 
from  44  patients  with  malignant  tumors,  and  72  patients  with 
benign  tumors.  The  area  under  the  receiver  operating  character¬ 
istics  (ROC)  curve  of  the  scaled  nearest  mean  classifier  (NMSC) 
using  the  three  features  yields  a  value  of  0.91  with  a  significance 
level  of  0.05.  Our  results  suggest  that  the  features,  relative  blood 
concentration,  and  oxygen  saturation  can  differentiate  breast 
tumors  with  a  relatively  high  precision. 

I.  Introduction 

American  cancer  society  (ACS)  estimates  that  a  total  of 
approximately  200,000  new  cases  of  invasive  breast  cancer 
occur  in  women  in  the  United  States  every  year  [1].  Currently, 
there  are  over  2  million  women  living  in  the  US  who  have  been 
diagnosed  with  and  treated  for  breast  cancer.  A  total  of  40,410 
women  and  470  men  are  predicted  to  die  from  breast  cancer  in 
the  US  during  the  year  2007  as  per  ACS  estimates  [1].  Breast 
cancer  continues  to  be  the  leading  cancer  site  among  American 
women.  Early  detection  is  critical  for  effective  treatment  of 
breast  cancer.  Patients  with  tumors  1  cm  or  less  in  size  have 
a  greater  than  90  percent  long-term  survival  [2], 

In  recent  years,  there  has  been  considerable  interest  in 
near-infrared  (NIR)  optical  spectroscopy  and  tomography  tech¬ 
niques  since  they  provide  contrast  information  that  is  specific 
to  oxyhemoglobin,  deoxyhemoglobin,  and  water  which  can 
potentially  be  used  for  early  detection  and  diagnosis  of  breast 
cancer  [3]— [8]. 

Correct  interpretation  of  the  optical  indices  (i.e.  de¬ 
oxyhemoglobin,  oxyhemoglobin,  blood  volume,  water  con¬ 
tent,  scattering,  and  absorption)  obtained  by  optical  spec¬ 
troscopic/tomographic  techniques  is  also  important  as  well 
as  acquiring  them.  Several  research  groups  demonstrated 
that  the  contrast  in  optical  indices  can  provide  information 
that  allows  for  better  characterization  of  breast  cancer  [3]- 
[5],  In  [3],  Pogue  et  al.  presented  a  way  to  measure  and 
obtain  hemoglobin  concentration,  oxygen  saturation,  water 
fraction,  scattering  power,  and  scattering  amplitude.  These 
indices  were  then  investigated  for  the  differences  between 
healthy  and  diseased  breast  tissues.  In  [4],  Grosenick  et  al. 
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reported  on  the  optical  indices,  scattering  and  absorption 
coefficients,  hemoglobin  concentration,  and  blood  oxygen 
saturation  obtained  using  optical  measurements.  Their  results 
showed  that  these  optical  indices  can  be  used  to  distinguish 
carcinomas  from  healthy  breast  tissues.  Recently,  Khayat  et 
al.  [5]  presented  characterization  results  of  optical  indices, 
oxyhemoglobin,  deoxyhemoglobin,  blood  volume,  lipid  and 
water  content,  scattering  and  absorption  coefficients,  using 
optical  imaging.  The  results  showed  the  ability  of  optical 
imaging  to  characterize  different  types  of  breast  lesions. 

In  this  work,  we  evaluated  the  characterization  efficiency 
of  optical  properties  of  breast  tumors  using  in  vivo  data 
obtained  by  near-infrared  (NIR)  spectroscopy.  Our  evaluation 
criteria  is  based  on  statistical  classification  techniques.  Three 
features,  namely,  relative  blood  concentration,  A BV,  oxygen 
saturation,  A  Deoxy,  and  the  size  of  the  tumor,  S,  were  used 
to  characterize  benign  and  malignant  tumors.  The  performance 
of  the  proposed  set  of  features  were  evaluated  using  various 
classifiers  on  44  patients  with  malignant  tumors,  and  72 
patients  with  benign  tumors.  The  area  under  the  receiver 
operating  characteristics  (ROC)  curve  of  the  scaled  nearest 
mean  classifier  (NMSC)  using  the  three  features  yields  a  value 
of  0.91  with  a  significance  level  of  0.05. 

The  rest  of  the  paper  is  organized  as  follows:  In  Section 
II,  we  present  the  NIR  apparatus,  and  data  protocol,  followed 
by  feature  extraction  and  tumor  classification.  In  Section  III, 
we  present  statistical  analysis  of  clinical  data.  Section  IV 
summarizes  our  results. 

II.  Methods 

A.  Apparatus 

In  this  study,  a  continuous  wave  (CW)  near  infrared  spec¬ 
trometer  (NIRS)  is  used  [9],  The  apparatus  includes  a  probe 
(Fig.  2).  In  the  center  of  the  probe  there  is  a  3 -wavelength 
light  emitting  diode  (LED).  The  probe  consisted  of  one  multi¬ 
wavelength  LED  as  a  light  source  and  8  silicon  diodes  as 
detectors.  The  detectors  surround  the  LED  with  a  4  cm  radius. 
The  light  intensity  from  the  detectors  was  adjusted  to  be 
approximately  1  volt  and  calibrated  with  a  phantom  with 
known  absorption  and  scattering  coefficients. 


Fig.  1 .  The  NIR  probe  with  a  multi-wavelength  LED  and  8  silicon  diodes 
as  detectors. 

B.  Patients  and  Protocol 

This  study  includes  two  centers,  namely,  the  Abramson 
Family  Cancer  Research  Institute,  Department  of  Radiology 
of  the  Hospital  of  University  of  Pennsylvania  (HUP),  and  the 
Department  of  Gynecology  of  Leipzig  University  (DGLU). 
HUP  provided  24  patients  with  malignant  and  64  patients 
benign  tumors.  DGLU  provided  20  patients  with  malignant 
and  6  benign  tumors. 

The  measurements  are  taken  on  the  breast  with  tumor.  Then, 
the  probe  is  transferred  to  the  tumor  free  contralateral  breast  to 
include  the  mirror  image  location  of  the  suspected  cancer.  The 
sensors  giving  the  largest  changes  with  respect  to  the  mirror 
image  position  on  the  contralateral  breast  are  related  to  the 
suspected  cancer. 

C.  Feature  Extraction 

In  this  study,  three  features,  namely,  relative  blood  concen¬ 
tration,  A BV,  oxygen  saturation,  A Deoxy,  and  the  size  of 
the  tumor,  S,  are  used. 

The  features,  A  BV,  and  A  Deoxy  are  obtained  using 

A  OD  =  sA  CL  (1) 

where  OD  is  the  optical  density,  £  is  the  extinction  coefficient, 
C  is  blood  concentration,  L  is  the  mean  pathlength  of  photons, 
and  A  denotes  relative  change.  Here,  £  ss  1  cm-1,  and  L  1 
cm  for  a  pathlength  factor  of  5. 

Following  (1),  the  relative  blood  concentration,  A  BV,  and 
the  oxygen  saturation,  A  Deoxy,  can  be  approximated  at  two 
different  wavelengths  by 

A  BV  ex  0.3AOD73o  +  A  ODS50  (2) 

ADeoxy  oc  I.3AO.D730  +  AODgso  (3) 

where  AOD730,  and  AOD^q  denote  the  relative  changes  in 
optical  density  at  730  nm  and  850  nm,  respectively. 

A  BV  and  ADeoxy  can  also  be  approximated  by 

ABV  ex  A  [Hb]  +  A  [ Hb02 }  (4) 

ADeoxy  ex  A[iT602]  —  A[Hb]  (5) 

where  A [Hb\ ,  and  A[iT602]  denote  the  relative  change  in 
deoxyhemoglobin  (Hb)  and  oxyhemoglobin  (Hb02). 


The  concentrations  of  Hb,  and  Hb02  in  (4)  and  (5)  are 
calculated  by  the  Beer- Lambert  Law  given  by 

A  OD  =  log-j  (6) 

where  I  is  light  intensity  after  absorption  and  scattering,  and 
I0  is  the  baseline  light  intensity  obtained  from  the  contralateral 
breast,  using  known  extinction  coefficients  of  Hb,  Hb02  and 
differential  pathlength  factors  [10]. 

Here,  it  is  important  to  note  that,  ABV ,  and  ADeoxy 
values  are  based  on  a  lipid  blood  oxygen  model.  Thus  the 
increments  of  BV  and  Deoxy  are  relative  to  the  contralateral 
breast: 

ABV  =  A  BVtumor  —  ABVcontra  (7) 

ADeoxy  =  ADeoxytUmor  ~  A Deoxy contra  (8) 

where  A  BVtumor,  ABVcontra  are  relative  blood  volume  in  the 
tumor  breast  and  the  mirror  image  position  of  the  contralateral 
breast,  respectively,  and  ADeoxytumor ,  ADeoxycontra  are 
relative  oxygen  saturation  in  the  tumor  breast  and  the  mirror 
image  position  of  the  contralateral  breast,  respectively. 

D.  Feature  Analysis  and  Tumor  Classification 

In  this  subsection,  we  present  the  set  of  tumor  classifi¬ 
cation  features,  and  the  malignancy  differentiation  criteria. 
FI  denotes  ABV,  F2  denotes  ADeoxy,  and  F3  denotes,  S, 
size  of  the  tumor.  We  evaluate  the  malignancy  differentiation 
capability  of  the  individual  features  and  various  combinations 
of  the  these  features  using  a  set  of  classifiers,  namely,  k-nearest 
neighbor  classifier  (KNNC),  Parzen  density  based  classifier 
(PAR),  automatic  neural  network  classifier  (NEURC),  normal 
densities  based  linear  classifier  (LDC),  nearest  mean  classifier 
(NMC),  scaled  nearest  mean  classifier  (NMSC),  normal  den¬ 
sities  based  quadratic  classifier  (QDC),  uncorrelated  normal 
densities  based  quadratic  classifier  (UDC).  The  more  details 
information  on  these  classifiers  can  be  found  in  [12], 

We  evaluated  the  malignancy  differentiation  capability  of 
the  following  individual  and  combined  features: 

FI:  ABV 
F2  :  ADeoxy 
F3  :  Tumor  Size  ( S ) 

F1-F2:  ABV  and  ADeoxy 
F1-F2-F3:  ABV,  ADeoxy,  and  S 

III.  Statistical  Analysis  of  Clinical  Data 

The  evaluation  is  based  on  receiver  operating  characteristics 
(ROC)  methodology.  The  ROC  curve  is  obtained  by  plotting 
the  probability  of  false  positive  rate  versus  the  probability 
of  detection.  The  evaluation  of  classification  method  is  done 
using  area  under  the  ROC  curve  (AUC).  First,  we  evaluated  the 
classification  performance  of  all  three  features.  Table  I  presents 
the  AUC  values  for  8  different  classifiers  for  all  three  features. 
The  NMSC  has  the  best  performance  in  terms  of  classification 
with  a  AUC  value  of  0.9098  followed  by  the  Parzen  classifier 
with  a  AUC  value  of  0.9041. 


TABLE  I 

AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  FLF2-F3:  A BV,  ADeoxy,  S 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.9098 

0.9041 

0.9017 

0.8984 

0.8864 

0.8843 

0.8807 

0.8752 

TABLE  II 

AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  F1-F2:  A  BV ,  ADeoxy 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.9001 

0.8993 

0.8930 

0.8908 

0.8992 

0.8821 

0.8782 

0.8645 

Fig.  2.  Scaled  Nearest  Mean  Classifier  and  F1-F2  2-D  data  clustering. 


Next,  we  evaluated  the  performance  of  the  two  features  mea¬ 
sured  by  NIR  spectroscopy.  Table  II  presents  the  AUC  values 
for  8  different  classifiers  for  features  A  BV  and  ADeoxy. 
Again,  the  NMSC  performed  the  best  in  terms  of  classification 
with  a  AUC  value  of  0.9001.  Finally,  we  evaluated  the 
individual  classification  performances  of  the  three  features. 
Table  III  presents  the  AUC  values  for  8  different  classifiers 
for  the  feature  A  BV.  The  NMC  has  the  best  performance 
in  terms  of  classification  with  a  AUC  value  of  0.8832.  Table 
IV  presents  the  results  8  different  classifiers  for  the  feature 
ADeoxy.  The  NMC  has  the  best  performance  in  terms  of 
classification  with  a  AUC  value  of  0.879.  Table  V  presents 
the  results  8  different  classifiers  for  the  feature  S.  The  QDC 
has  the  best  performance  in  terms  of  classification  with  a  AUC 
value  of  0.5612. 

As  it  can  be  seen  from  Tables  I,  and  II,  the  best  performing 
feature  set  is  the  combination  of  the  three  features.  We  can 
also  conclude  from  Table  V  that,  the  tumor  size  can  not  be 
used  to  differentiate  healthy  and  diseased  tissues  with  an  AUC 
value  of  around  0.5.  However,  the  combination  set  of  optical 
indices,  obtained  using  optical  measurements,  can  differentiate 
breast  tumors  with  a  relatively  high  precision  with  a  AUC 
value  of  0.9.  Similarly,  optical  indices,  A  BV  and  ADeoxy , 
also  performed  well  with  AUC  values  of  0.883  and  0.879, 
respectively. 


Fig.  3.  Parzen  and  F1-F2  2-D  data  clustering. 


Figures  2,  and  3  show  the  distribution  of  features  A  BV,  and 
ADeoxy  extracted  from  benign  and  malignant  tumors.  The 
thresholds  were  computed  using  scaled  nearest  mean  classifier 
and  Parzen  classifiers.  Figure  4  presents,  the  ROC  curves  for 
all  three  features,  and  the  best  two  features,  namely,  ADeoxy 
and  A  BV.  The  observed  area  under  the  ROC  curve  for  Fl- 
F2-F3,  and  F1-F2  are  0.9098,  and  0.9001,  respectively.  Figure 
5  presents  the  ROC  curves  for  individual  features  FI,  and  F2 
using  the  nearest  mean  classifier.  The  observed  area  under  the 
ROC  curve  for  FI,  and  F2  are  0.8832  and  0.8790,  respectively. 

IV.  Conclusion 

In  this  work,  we  evaluated  the  characterization  efficiency 
NIR  optical  spectroscopy  using  three  features,  relative  blood 
concentration,  oxygen  saturation,  and  the  size  of  the  tumor. 
The  characterization  of  malignant  and  benign  tumors  are 
evaluated  using  different  classifiers.  Our  results  suggest  that 
the  relative  blood  concentration,  and  oxygen  saturation  has 
potential  to  differentiate  malignant  and  benign  breast  tumors 
with  a  relatively  high  accuracy.  This  set  of  features  can 
potentially  be  incorporated  into  a  diagnostic  systems  to  aid 
physicians  for  breast  cancer  diagnosis.  In  the  near  future,  we 
will  incorporate  additional  features  to  the  current  feature  set. 
We  plan  to  analyze  the  new  set  of  features  using  different 


Probability  of  Detection  Probability  of  Detection 


TABLE  III 

AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  FI:  A BV 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.8817 

0.8764 

0.8807 

0.8779 

0.8513 

0.8778 

0.8832 

0.8302 

TABLE  IV 

AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  F2:  A Deoxy 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.8787 

0.8764 

0.8776 

0.8711 

0.8491 

0.8613 

0.8790 

0.8331 

TABLE  V 

AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  F3:  S 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.5123 

0.5292 

0.4782 

0.5429 

0.5382 

0.5612 

0.5112 

0.4827 

;.  4.  ROC  curves  for  F1-F2-F3  and  F1-F2  using  NMSC  Classifier. 


Fig.  5.  ROC  curves  for  FI  and  F2  using  NMC  Classifier. 


sampling  techniques,  i.e.  hold-out,  leave-one-out,  resubstitu¬ 
tion  techniques,  and  further  investigate  the  characterization 
efficiency  optical  features.  We  also  plan  to  compare  the  ROC 
performance  of  the  optical  features,  FI  and  F2,  with  that  of  F3, 
using  a  hypothesis  testing  method  based  on  the  AUC  statistics. 
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Abstract — In  this  work,  we  propose  a  method  to  reconstruct 
spatially  resolved  pharmacokinetic  rate  images  of  fluorescence 
agents  directly  from  the  boundary  photon  flux  measurements. 
We  use  a  compartmental  modeling  scheme  to  model  the  phar¬ 
macokinetics  of  fluorescence  agents.  We  coupled  this  model  with 
the  fluorescence  diffuse  optical  tomography  (FDOT)  forward 
model  to  form  a  state  space  model  which  is  then  iteratively 
solved  by  extended  Kalman  filtering  (EKF)  algorithm.  As  an 
example,  we  used  a  two-compartment  model  for  indocyanine 
green  (ICG)  pharmacokinetics.  To  validate  the  proposed  method, 
we  tested  our  approach  using  a  simulation  study.  Reconstructed 
pharmacokinetic  rate  images  with  correct  localization  of  het¬ 
erogeneities  and  high  reconstruction  accuracy  show  that  the 
proposed  algorithm  can  be  used  for  tumor  detection,  tumor 
diagnosis,  drug  delivery  and  feasibility  studies. 

I.  Introduction 

Development  of  uorescence  agents  that  emit  light  upon 
excitation  lead  to  a  new  imaging  technique,  called  uorescence 
diffuse  optical  tomography  (FDOT),  whose  theory  is  a  straight¬ 
forward  extension  of  diffuse  optical  tomography.  In  FDOT, 
near  infrared  (NIR)  excitable  exogenous  uorescence  agents, 
which  are  externally  injected  into  the  tissue  are  investigated 
in  terms  of  their  optical  parameters  [1],  These  agents  enhance 
the  sensitivity  of  detection  hence  increase  the  optical  contrast. 
FDOT  has  the  ability  to  characterize  functional  parameters  of 
tissue  such  as  scattering  and  absorption,  which  can  be  used  to 
derive  pharmacokinetics  of  uorescence  agents. 

The  analysis  of  pharmacokinetics  of  uorescence  agents 
is  a  potential  means  for  tumor  detection,  diagnosis,  drug 
delivery  and  feasibility  studies  [2],  One  approach  to  phar¬ 
macokinetic  analysis  is  the  compartmental  modeling  [6].  In 
this  method,  a  region  of  interest  consists  of  a  number  of 
compartments,  generally  representing  a  volume  of  similar 
tissues  into  which  the  uorescence  agent  is  distributed.  The 
concentration  changes  in  a  speci  c  compartment  are  modeled 
as  a  result  of  the  exchange  of  uorescence  agents  between 
connected  compartments.  These  changes  are  modeled  by  a 
collection  of  coupled  ordinary  differential  equations  (ODE); 
each  equation  describing  the  time  change  dictated  by  the  bio¬ 
logical  laws  that  govern  the  concentration  exchanges  between 
the  interacting  compartments.  Coefcients  of  the  ODE’s  are 


the  pharmacokinetic  rates  that  represent  rates  of  exchange 
between  different  compartments. 

A  number  of  research  groups  reported  compartmental  mod¬ 
eling  of  uorescence  agents  for  tumor  diagnosis  in  animal 
and  human  subjects  [3]— [5],  In  these  studies,  bulk  values 
that  represent  pharmacokinetic  rates  either  over  the  entire 
imaging  domain  or  a  large  volume  is  used  for  tumor  de¬ 
tection/diagnosis.  In  such  cases,  spatially  resolved  represen¬ 
tation  of  pharmacokinetic  rates  may  increase  speci  city  and 
sensitivity  compared  to  average  bulk  rates.  Moreover,  there 
are  studies  in  literature  showing  that  the  spatially  varying 
pharmacokinetics  rates  are  quantitatively  different  inside  and 
outside  the  tumor  region  [7]— [9]. 

In  this  paper,  we  proposed  a  compartmental  modeling 
scheme  for  uorescence  agent  pharmacokinetics,  and  recon¬ 
structed  spatially  resolved  pharmacokinetic  rate  images  di¬ 
rectly  from  the  boundary  photon  ux  measurements.  We 
also  reconstructed  the  spatially  resolved  uorescence  agent 
concentration  images  inside  different  compartments.  In  our 
algorithm,  we  used  a  nite  element  method  (FEM)  [10]  as  a 
forward  solver  and  an  extended  Kalman  Iter  (EKF)  algorithm 
[3]  as  an  inverse  solver.  We  tested  our  approach  using  a 
simulated  data  set  based  on  a  two-compartment  model  for  ICG 
pharmacokinetics.  For  the  simulation  studies,  we  generated 
a  set  of  time  series  data  from  a  domain  with  tissue-like 
characteristics  using  FEM  algorithm  to  solve  the  coupled 
diffusion  equations  with  Robin  type  boundary  conditions.  Re¬ 
constructed  pharmacokinetic  images  with  correct  localization 
of  heterogeneities  and  high  reconstruction  accuracy  show  that 
the  proposed  algorithm  can  be  useful  for  tumor  diagnosis,  drug 
delivery  and  feasibility  studies. 

The  rest  of  the  paper  is  organized  as  follows:  In  Section 
II,  we  present  pharmacokinetic  analysis  and  compartmental 
modeling  for  uorescence  agents.  In  Section  III,  we  present 
FDOT  forward  problem.  Section  IV  describes  the  inverse 
problem  and  the  EKF  algorithm  for  direct  reconstruction  of 
pharmacokinetic  parameters.  In  Section  V,  we  present  the 
simulation  results.  Section  VI  summarizes  our  results  and 
conclusion. 


II.  Pharmacokinetic  Modeling  and 

COMPARTMENTAL  MODEL  ANALYSIS 

In  general,  the  continuous  time  state-space  representation 
for  an  rc-compartment  model  is  given  by 

C(i)  =  K(a„)C(i)  (1) 

m(t)  =  V(a„)C(f)  (2) 

where  C(t)  denotes  the  concentration  vector  whose  elements 
are  the  concentrations  of  the  uorescence  agents  in  different 
compartments,  and  C (t)  denotes  its  time  derivative.  m(t)  is 
the  bulk  uorescence  concentration,  n  is  the  number  of  com¬ 
partments,  an  is  the  parameter  vector  whose  elements  are  the 
pharmacokinetic  rate  constants  and  volume  fractions.  K(an) 
is  the  system  matrix  with  entries  being  the  pharmacokinetic 
rates,  and  V(an)  is  the  vector  containing  the  volume  fractions 
[6], 

Although  the  formulation  for  the  direct  reconstruction 
of  pharmacokinetic  rate  images  can  be  applicable  for  n- 
compartment  models,  here  the  two-compartment  model  for 
ICG  pharmacokinetics  will  be  our  running  example  [3],  [4]. 
Based  on  a  two-compartment  model  for  ICG  kinetics,  as 
shown  in  Figure  1,  the  ICG  transition  between  two  compart¬ 
ments  can  be  modeled  by  two  coupled  differential  equations: 

— ^  =  —koutCe(t)  +  kinCp(t),  (3) 

— =  +  keim)Cp(t)  +  koutCe(t).  (4) 

Here,  C(f)  in  (1)  and  (2)  includes,  Cp(t)  and  Ce(f),  repre¬ 
senting  the  ICG  concentrations  in  the  plasma  and  extracellular 
extravascular  space  (EES),  respectively.  The  parameter  vector, 

o?2,  includes,  />’; ,, ,  k0ut ,  keim ,  Vp ,  and  ue,, 

^2  [kin  kout  kelm  Vp  tle]  ,  (5) 

where  km  and  kout  are  the  pharmacokinetic  rates  that  govern 
the  leakage  into  and  the  drainage  out  of  the  EES,  keim 
describes  the  ICG  elimination  from  the  body  through  kidneys 
and  livers,  and  vp  and  ve  are  the  plasma  and  EES  volume 
fractions,  respectively. 


Fig.  1.  Block  diagram  of  the  two-compartment  model  for  ICG  pharmacoki¬ 
netics. 

Here,  to  obtain  2-D  images  of  pharmacokinetic  rates  and 
uorescence  agent  concentrations  in  different  compartments, 


we  extend  the  compartmental  model  equations  (1)  and  (2)  to 
spatially  resolved  model  as  follows: 

C(ri,f)  K(a„(n))  0  0  C(ri,f) 

:  =  0  0  : 

_C(rjv,  t)_  _  0  0  K(a„(rAr))_  _C(rN,t)_ 

+  w(r  ,t)  (6) 

where  u>(r,t)  is  uncorrelated  zero  mean  Gaussian  processes 
with  covariance  matrix  Q,  representing  the  small  deviations 
resulting  from  model  mismatch,  r  =  [ri  r2-...rjv],  and  r3  is 
the  location  of  the  jth  voxel  for  j  =  1,2,3 ...N,  N  being  the 
number  of  total  voxels  in  the  discritized  domain. 

|"  Tn(r- t,  f)  ’V(an(ri))  0  0  C(ri,f) 

:  =  0  0  : 

m(rN,t)_  _  0  0  V(a„(rjv))_  _C(rN,t)_ 

+  V(r,t)  (7) 

where  r/(r,  t)  is  a  zero  mean  Gaussian  process  with  covariance 
matrix  R  presenting  the  noise  in  the  measurements. 

The  implicit  form  of  (6)  and  (7)  are  given  by: 

C(r,f)  =  K(an(r))C(r,f)  +  w(r,f)  (8) 

m(r,  t)  =  V(an(r))C(r,  t)  +  rj(  r,  t)  (9) 

The  quantity  we  wish  to  reconstruct  is  the  spatially  varying 
pharmacokinetic  rate  parameters  instead  of  the  absorption 
coef  cient  of  the  orescence  agents  as  commonly  the  case  in 
FDOT  [1],  To  do  this,  we  rst  need  to  develop  a  mapping 
which  relates  the  boundary  ux  measurements  to  the  bulk 
orescence  agent  concentrations  in  tissue.  This  mapping  can 
be  obtained  by  using  FDOT  forward  model  which  will  be 
explained  in  the  next  section. 

III.  Fluorescence  Diffuse  Optical  Tomography 

We  used  a  coupled  system  of  diffusion  equations  to  model 
uorescence  light  propagation  in  tissue  [1],  Based  on  the 
coupled  diffusion  equations,  the  forward  model  for  FDOT  can 
be  expressed  as: 

(10) 

where  na f(r,t)  =  [fiaf(ri,t),  ■  ■  ■ ,  /J,af(rN,  t)]  is  the  time 
dependent  absorption  coef  cient  of  the  uorescence  agent, 
\&(r,  t)  =  [4/ (t*i ,  i) ,  •  *  • ,  4/(rjv,f)]  is  the  time-dependent 
boundary  ux  measurements,  and  /  is  a  nonlinear  function 
de  ned  by  the  coupled  diffusion  equation.  Under  the  assump¬ 
tion  that  the  exogenous  optical  properties  has  no  effect  on 
endogenous  optical  properties,  (10)  can  be  linearized  to  obtain 

V(r,t)=Wnaf(r,t),  (11) 

where  W  is  the  weight  matrix  which  maps  the  absorption 
coef  cients  to  the  boundary  measurements  obtained  using  the 
FEM  algorithm  [10]. 

The  absorption  coef  cient  of  the  uorescence  agents  is 
related  to  the  bulk  uorescence  agent  concentration  as  follows: 

Ma/(M)  =  em(  M)  (12) 


where  e  is  the  extinction  coef  cient  of  the  uorescence  agent 
at  the  excitation  wavelength. 

A.  Pharmacokinetic  Rate  to  Measurement  Map  for  Fluores¬ 
cence  Agents 

To  derive  a  complete  formulation  to  reconstruct  the  phar¬ 
macokinetic  parameters  and  concentrations  in  different  com¬ 
partments  using  the  photon  ux  measurements,  we  combined 
the  FDOT  forward  problem  equations,  (10)  or  (11)  with  the 
compartmental  model  equations. 

Combining  (9)  and  (10),  with  the  knowledge  of  the  linear 
relationship  between  the  absorption  coef  cient  and  the  bulk 
uorescence  concentration,  (12),  nonlinear  FDOT  forward 
problem  is  related  to  the  bulk  uorescence  agent  concentration 
as: 

t)  =  g{ m(r,  t))  ~  sr(V(o:n(r))C(r,  t))  (13) 

where  g  is  the  non-linear  function  which  relates  the  bulk 
uorescence  agent  concentrations  to  the  boundary  ux  mea¬ 
surements. 

The  linearized  version  of  (11)  is  given  as: 

M/  (r,  i)  =  tWm(r,() 

=  eWV(a„(r))C(r,t)  +  eWr)(r,t) 

=  rC(r,t)  +  e\Vr](r,t),  (14) 

where  W,  V  and  77  (r,  t)  are  de  ned  as  above  and  V  = 
eWV(a„(r)). 

The  equations  (8)  and  (13),  which  combine  the  FDOT 
forward  problem  with  the  compartment  model  equations  con¬ 
stitute  the  set  of  equations  which  will  be  used  for  the  direct 
reconstruction  of  pharmacokinetic  rates,  volume  fractions,  and 
uorescence  concentrations  in  different  compartments. 

IV.  Direct  Reconstruction  of  Pharmacokinetic 
Rate  and  Concentration  Images  from  Optical 
Flux  Measurements 

The  source  detector  measurements  in  (9)  are  collected  at 
discrete  time  instances,  t  =  kT,  k  =  0, 1, ...,  where  T  is  the 
sampling  period.  Therefore,  the  continuous  model  described 
in  (8)  and  (14)  has  to  be  discretized.  To  simplify  our  notation, 
we  shall  use  C(r,  k)  =  C(r,  kT)  and  \P(r,  k)  =  \P(r,  kT). 

Let  Bn  denote  the  discrete-time  parameter  vector  of  the 
pharmacokinetic  rates  and  volume  fractions.  The  parameter 
vector  6n  (r)  can  be  either  time  dependent  or  time  independent. 
The  formulation  given  in  this  work  can  be  used  for  both 
cases.  In  our  case,  the  pharmacokinetic  rates  and  the  volume 
fractions  are  time  independent.  However,  in  order  to  estimate 
6n  within  the  EKF  framework,  the  following  dynamic  model 
is  introduced: 


0„(r,  k  +  1)  =  0n(  r,  k)  +  <?(r,  k),  (15) 

where  <;(r,  k)  is  a  zero  mean  white  noise  process  with  covari¬ 
ance  matrix  S.  The  details  of  the  dynamic  model  introduced 
for  the  joint  estimation  of  the  system  parameters  and  the  states 
can  be  found  in  [3]. 


1 )  A  priori  information  for  Pharmacokinetic  Rates  and 
Volume  Fractions:  To  improve  the  robustness  of  estimates  of 
the  parameters,  we  impose  a  priori  information  on  pharma¬ 
cokinetic  parameters  and  volume  fractions.  Here,  we  assume 
that  there  is  no  information  about  the  tumor  structure  and  used 
a  4-pixel  neighborhood  model. 

Using  the  4-pixel  neighbor  model  with  equal  weights  (3,  the 
random  process,  0„(r,  k),  can  be  modeled  as: 


4 

9n(rj,k  + 1)  =  (36  n (rj ,  k)  +  (3 ^  dn (r jj ,  k)  +  <?(r , k) .  (16) 

i= 1 


2)  Extended  Kalman  Filter  (EKF)  Formulation:  In  our 
state-space  model,  (8)  and  (9),  both  the  states  (concentrations) 
and  system  parameters  (pharmacokinetic  rates  and  volume 
fractions)  are  unknown.  In  this  case,  the  state-space  model  can 
be  regarded  as  a  non-linear  model  in  which  system  parameters 
and  states  are  combined  to  form  the  new  states  of  the  non¬ 
linear  model.  This  system  is  then  linearized  and  solved  for 
the  unknown  states  using  EKF  framework  [3].  To  solve  for 
the  concentrations  and  the  unknown  parameters,  the  parameter 
vector  Qn( r,  k)  is  inserted  into  the  concentration  vector  C(r,  k) 
as: 


C(r,  k  + 1) 

K(0„(r,fc))C(r,fc) 

Jw(r,  k) 

_On(rj,k+l)_ 

_(30n(rj,k)  +  (3j:l10n(rjUk)\ 

I  «(r,  k)  _ 

(17) 

The  measurement  equation  for  the  non-linear  case  is  given 
as: 


<&{r,k)  =  [  <?(V(an(r))C(r,  fc))  0 


C(r,  k) 
#n(r,  k) 


+g(r,  k). 


(18) 

Without  explicit  proof,  the  extended  Kalman  ltering  al¬ 
gorithm  for  simultaneous  estimation  of  concentrations  and 
parameters  for  the  nonlinear  case,  (13),  is  given  follows: 


'  C(r,  0)  ' 

■  E( C(r,0))  ' 

0„(r,O)  _ 

Mr,0) 

(19) 


Po,o  = 


Var(C(r,  0))  0 

0  sd  ’ 


(20) 


where  E  denotes  the  expected  value  of  C(r,  0),  P  is  the  error 
covariance  matrix,  and  S,/  is  the  preassigned  covariance  matrix 
of  the  unknown  system  parameters. 

The  following  equations  describe  how  the  concentration 
estimates  and  error  covariance  matrix  are  updated  at  the  kth 
time  instant  given  all  the  measurements  up  to  ( k  —  l)th  time 
instant.  For  k  =  1,2, ..., 


'  C(r,  k\k  —  1)  ' 

K(0n(r,  k  -  l))C(r,  k  -  1) 

Qn(rj,k\k  -  1)_ 

P0n(r j,  k  -  1)  +  On(rji,  k) 

~  .  (21) 

Pfc|fc-i  —  J  (k  —  l)Pfe-i,fe-iJT(fc  —  1) 


Q  d  0 
0  sd 


,  (22) 


where  J  is  the  Jacobian  matrix  given  by 


K(0„(r  ,*))  ^[K(0„(r,fc))C(r,fc)] 

■^dn(r,k)  -^J)n(r,k) 


(23) 


Gfc  =  Pfe,fc_1AT[APfc,fc_1AT  +  R]-1,  (24) 

where  Gfc  is  the  recursive  Kalman  gain,  R  is  the  covariance 
matrix  of  the  measurements  and  A  is: 

^g(V(On(r,k\k-l)C(r,k\k-l))  1T 
A  =  •  ,  (25) 

_  ^(^(V(^(r,fc|fc-l)C(r,fc|fc-l)  _ 

Pfc,fc  =  [I —  GfcAjPfe^-i.  (26) 


where  I  is  the  identity  matrix. 


'  CM)  ' 

C(r,  k\k  —  1) 

_&n(Tj ,  /c)_ 

J36n(rj,k\k  -  1)  +  ® n(rji,k\k  -  1)_ 

+Gfc(\P(r,  k)  -  g(V(0n( r,  k\k  -  l)C(r,  k\k  -  1)). 

V.  Simulations  and  Results 

To  validate  our  approach,  we  performed  a  simulation  study 
based  on  the  two-compartment  model  proposed  for  ICG.  Using 
physiologically  correct  values  for  pharmacokinetic  rates,  Ay„, 
kout,  helm,  and  volume  fractions,  ve,  vp,  around  the  tumor 
region,  a  set  of  time  series  data,  M>(r,  i),  was  generated 
from  a  simulated  domain  with  tissue-like  characteristics.  To 
generate  the  synthetic  measurements,  the  diffusion  equation 
was  solved  numerically  using  FEM  algorithm  with  Robin  type 
boundary  conditions  [10].  The  simulation  used  a  modulation 
frequency  300  MHz.  The  phantom  is  6cm  by  6cm  in  size,  and 
it  is  discretized  into  24  by  24  voxels  each  of  size  0.25cm 
by  0.25cm.  The  24  sources  and  24  detectors  are  arranged 


Fig.  2.  Pharmacokinetic  rate  images  of  kin  :  (a)  true,  and  (b)  reconstructed. 


+ 


Fig.  3.  Pharmacokinetic  rate  images  of  kout  •  (a)  true,  and  (b)  reconstructed. 


boundaries.  Figures  2a  and  3a  display  the  true  images  of 
pharmacokinetic  rates  kin  and  kout .  Figures  2b  and  3b  display 
the  corresponding  reconstructed  images.  We  observe  that  there 
is  a  good  agreement  between  the  true  and  the  estimated  images 
in  terms  of  localization  of  the  heterogeneities.  We  calculated 
the  percent  error  between  the  true  images  and  the  reconstructed 
images  by  the  ratio  of  the  Lr  norm  of  the  error  and  the  L 2 
norm  of  the  true  image.  The  percent  error  for  and  kout 
are  11.95  %  and  13.11  %,  respectively. 

VI.  Conclusion 

In  this  paper,  we  provided  a  formulation  for  the  recon¬ 
struction  of  spatially  resolved  pharmacokinetic  rate  images 
of  uorescence  agents  directly  from  the  boundary  photon 
ux  measurements.  We  performed  a  simulation  study  using 
a  numerical  phantom.  Reconstructed  images  with  small  errors 
show  that  the  algorithm  can  be  used  for  real  data  analysis.  In 
the  near  feature,  we  plan  to  apply  the  proposed  algorithm  to 
the  ICG  concentration  data  acquired  from  breast  tumors. 
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Abstract 

In  diffuse  optical  tomography  (DOT),  the  discretization  error  in  the  numerical 
solutions  of  the  forward  and  inverse  problems  results  in  error  in  the 
reconstructed  optical  images.  In  this  first  part  of  our  work,  we  analyse  the  error 
in  the  reconstructed  optical  absorption  images,  resulting  from  the  discretization 
of  the  forward  and  inverse  problems.  Our  analysis  identifies  several  factors 
which  influence  the  extent  to  which  the  discretization  impacts  on  the  accuracy  of 
the  reconstructed  images.  For  example,  the  mutual  dependence  of  the  forward 
and  inverse  problems,  the  number  of  sources  and  detectors,  their  configuration 
and  their  orientation  with  respect  to  optical  absorptive  heterogeneities,  and  the 
formulation  of  the  inverse  problem.  As  a  result,  our  error  analysis  shows  that 
the  discretization  of  one  problem  cannot  be  considered  independent  of  the  other 
problem.  While  our  analysis  focuses  specifically  on  the  discretization  error  in 
DOT,  the  approach  can  be  extended  to  quantify  other  error  sources  in  DOT  and 
other  inverse  parameter  estimation  problems. 


1.  Introduction 

Imaging  in  diffuse  optical  tomography  (DOT)  comprises  two  interdependent  stages  which 
seek  solutions  to  the  forward  and  inverse  problems.  The  forward  problem  is  associated  with 
describing  the  near-infrared  (NIR)  light  propagation,  while  the  objective  of  the  inverse  problem 
is  to  estimate  the  unknown  optical  parameters  from  boundary  measurements  [2], 

There  are  a  variety  of  factors  that  affect  the  accuracy  of  the  DOT  imaging,  such  as 
model  mismatch  (due  to  the  light  propagation  model  and/or  linearization  of  the  inverse 
problem),  measurement  noise,  discretization,  numerical  algorithm  efficiency  and  inverse 
problem  formulation.  In  this  two-part  study,  we  focus  on  the  effect  of  discretization  of  the 
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forward  and  inverse  problems.  In  the  first  part  of  our  work,  we  present  an  error  analysis  to  show 
the  effect  of  discretization  on  the  accuracy  of  the  reconstructed  optical  absorption  images.  We 
identify  the  factors  specific  to  the  imaging  problem,  which  determine  the  extent  to  which  the 
discretization  impacts  on  the  accuracy  of  the  reconstructed  optical  absorption  images.  In  the 
following,  first,  we  use  the  error  analysis  to  develop  novel  adaptive  discretization  algorithms 
for  the  forward  and  inverse  problems  to  reduce  the  error  in  the  reconstructed  optical  images 
resulting  from  discretization.  Next,  we  present  numerical  experiments  that  support  the  main 
results  of  part  I  and  demonstrate  the  effectiveness  of  the  developed  adaptive  mesh  generation 
algorithms. 

There  has  been  extensive  research  on  the  estimation  of  discretization  error  in  the  solutions 
of  partial  differential  equations  (PDEs)  [1,  5-7,  21,  22].  In  contrast,  relatively  little  has  been 
published  in  the  area  of  parameter  estimation  problems  governed  by  PDEs.  See  for  example 
[8]  for  an  a  posteriori  error  estimate  for  the  Lagrangian  in  the  inverse  scattering  problem 
for  the  time-dependent  acoustic  wave  equation  and  [19]  for  a  posteriori  error  estimates  for 
distributed  elliptic  optimal  control  problems.  In  the  area  of  DOT,  it  was  numerically  shown 
that  the  approximation  errors  resulting  from  the  discretization  of  the  forward  problem  can  lead 
to  significant  errors  in  the  reconstructed  optical  images  [3].  However,  an  analysis  regarding 
the  error  in  the  reconstructed  optical  images  resulting  from  discretization  has  not  been  reported 
so  far. 

In  this  work,  we  model  the  forward  problem  by  the  frequency-domain  diffusion  equation. 
For  the  inverse  problem,  we  focus  on  the  estimation  of  the  absorption  coefficient.  We  consider 
the  linear  integral  equation  resulting  from  the  iterative  linearization  of  the  inverse  problem 
based  on  Born  approximation  and  use  zeroth-order  Tikhonov  regularization  to  address  the 
illposedness  of  the  resulting  integral  equation.  We  use  finite  elements  with  first-order  Lagrange 
basis  functions  to  discretize  the  forward  and  inverse  problems  and  analyse  the  effect  of  the 
discretization  of  each  problem  on  the  reconstructed  optical  absorption  image.  Our  analysis 
describes  the  dependence  of  the  image  quality  on  the  optical  image  properties,  the  configuration 
of  the  source  and  detectors,  the  orientation  of  the  source  and  detectors  with  respect  to  absorptive 
heterogeneities,  and  on  the  regularization  parameter  in  addition  to  the  discretization  error  in  the 
solution  of  each  problem.  In  our  analysis,  we  first  consider  the  impact  of  the  inverse  problem 
discretization  when  there  is  no  discretization  error  in  the  solution  of  the  forward  problem,  and 
provide  a  bound  for  the  resulting  error  in  the  reconstructed  optical  image.  Next,  we  analyse 
the  effect  of  the  forward  problem  discretization  on  the  accuracy  of  the  reconstructed  image 
without  discretizing  the  inverse  problem,  and  obtain  another  bound  for  the  resulting  error  in 
the  reconstructed  optical  image.  We  see  that  each  error  bound  comprises  the  discretization 
error  in  the  corresponding  problem  solution,  scaled  spatially  by  the  solutions  of  both  problems. 
This  is  a  direct  consequence  of  the  fact  that  the  inverse  problem  solution  depends  on  the  model 
defined  by  the  forward  problem.  As  a  result,  the  error  analysis  yields  specific  error  estimates 
which  are  different  than  the  conventional  discretization  error  estimates  (see  equations  (3.8) — 
(3.9)  and  (4.14))  which  only  take  into  account  the  smoothness  and  support  of  the  function  of 
interest,  and  the  finite-dimensional  space  of  approximating  functions  [9].  We  further  discuss 
the  use  of  other  basis  functions  and  methods  in  the  discretization  of  the  forward  and  inverse 
problems  and  explain  how  the  error  bounds  can  be  modified  accordingly.  Finally,  we  extend 
our  analysis  to  show  the  effect  of  noise  on  the  accuracy  of  the  reconstructed  optical  images. 
Our  analysis  shows  that  the  presence  of  noise  results  in  error  terms  in  addition  to  the  error 
in  the  reconstructed  optical  images  induced  by  the  discretization  of  the  forward  and  inverse 
problems. 

This  work  not  only  provides  an  insight  into  the  error  in  reconstructed  optical  absorption 
images  resulting  from  discretization,  but  also  motivates  the  development  of  novel  adaptive 
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mesh  generation  algorithms  to  address  this  error  [14].  In  addition,  the  analysis  presented  in 
this  work  provides  a  means  to  identify  and  analyse  the  error  in  the  reconstructed  optical  images 
resulting  from  the  linearization  of  the  Lippmann-Schwinger-type  equations  [10]  using  Born 
approximation  [15].  Furthermore,  the  error  analysis  introduced  in  this  paper  is  not  limited 
to  DOT,  and  can  easily  be  extended  for  use  in  similar  inverse  parameter  estimation  problems 
such  as  electrical  impedance  tomography,  bioluminescence  tomography,  optical  fluorescence 
tomography,  microwave  imaging  etc,  in  all  of  which  the  inverse  problem  can  be  interpreted 
in  terms  of  a  linear  integral  equation,  whose  kernel  is  the  solution  of  a  PDE  that  models  the 
forward  problem. 

The  outline  of  this  paper  is  as  follows:  section  2  defines  the  forward  and  inverse  problems. 
In  section  3,  we  discuss  the  discretization  of  the  forward  and  inverse  problems.  In  section  4, 
we  present  two  theorems  that  summarize  the  impact  of  discretization  on  the  accuracy  of  the 
reconstructed  optical  images,  which  is  followed  by  section  5.  The  appendices  include  results 
regarding  the  boundedness  and  compactness  of  the  linear  integral  operator  used  to  define  the 
inverse  problem,  and  the  proof  for  the  convergence  of  the  inverse  problem  discretization. 

2.  Forward  and  inverse  problems 

In  this  section,  we  describe  the  model  for  NIR  light  propagation  and  define  the  forward  and 
inverse  DOT  problems.  Table  1  provides  a  list  of  the  notation  and  table  2  provides  the  definition 
of  function  spaces  and  norms  used  throughout  the  paper.  We  note  that  we  use  calligraphic 
letters  to  denote  the  operators,  e.g.  Aa,1,K  etc. 

2.1.  Forward  problem 

We  use  the  following  boundary  value  problem  to  model  the  NIR  light  propagation  in  a  bounded 
domain  £2  C  R3  with  Lipschitz  boundary  <)Q  [2,  9]: 


-V  •  D(x)Vgj(x)  +  (/rfl(x)  +  y)  gj(x)  =  QjQQ 

x  e  £2, 

(2.1) 

gi(x)  +  2aD(x)^-(x)  —  0  x  e  dtt, 

3  n 

(2.2) 

where  gj  (x)  is  the  photon  density  at  x,  Qj  is  the  point  source  located  at  xs; ,  j  =  1 ,  ,NS, 
where  Ns  is  the  number  of  sources,  Z)(x)  is  the  diffusion  coefficient  and  pa  (x)  is  the  absorption 
coefficient  at  x,  i  =  V—  T,  u>  is  the  modulation  frequency  of  the  source,  c  is  the  speed  of  the 
light,  a  =  (I  +  R)/(\  —  R)  where  R  is  a  parameter  governing  the  internal  reflection  at  the 
boundary  ()Q.  and  3  •  /3 n  denotes  the  directional  derivative  along  the  unit  normal  vector  on 
the  boundary.  Note  that  we  assume  the  diffusion  coefficient  is  isotropic.  For  the  general 
anisotropic  material,  see  [17]. 

The  adjoint  problem  [2]  associated  with  (2. 1) — (2.2)  is  given  by  the  following  boundary 
value  problem: 

-V  •  D(x)Vg*(x)  +  (/rfl(x)  -  g*(x)  =  0  xell,  (2.3) 

dg* 

g*(x)  +  2a D (x) (x)  =  g*(x)  x  e  (2.4) 

on 

where  Q*  is  the  adjoint  source  located  at  x'd,i  =  I .  . . . ,  /V,/ ,  where  N,i  is  the  number  of 
detectors.  We  note  that  we  approximate  the  point  source  Qj  in  (2.1)  and  the  adjoint  source 
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Table  1.  Definition  of  variables  functions,  and  operators. 

Notation  Explanation 

£2  Bounded  domain  in  R3  with  Lipschitz  boundary 

3  £2  Lipschitz  boundary  of  £2 

x  Position  vector  in  £2  U  3  £2 

gj  (x)  Solution  of  the  diffusion  equation  at  x  for  the  j th 
point  source  located  at  x{ 

g*  (x)  Solution  of  the  adjoint  problem  at  x  for  the  / th 
adjoint  source  located  at  x'd 
Gj  (x)  Finite-element  approximation  of  gj  at  x 

G*  (x)  Finite-element  approximation  of  g*  at  x 

ej  (x)  The  discretization  error  at  x  in  the  finite-element 
approximation  of  gj 

e*  (x)  The  discretization  error  at  x  in  the  finite-element 
approximation  of  gf 

a(x)  Small  perturbation  over  the  background  /ia  at  x 

r ij  Differential  measurement  at  the  i  th  detector 

due  to  the  j  th  source 

Aa  The  matrix-valued  operator  mapping  a  e  L°°(£ 2)  to  <CNdxNs 

A*  The  adjoint  of  Aa  mapping  from  CNdxNs  to  L1  (£2) 

Hi  j  (x)  The  kernel  in  Aa  at  x  for  the  /th  detector  and  the  jth  source 

H*j  (x)  The  kernel  in  A*  at  x  for  the  /th  detector  and  the  jth  source 

y(x)  A*T  at  x 

A.  The  regularization  parameter 

ax  (x)  Solution  of  the  regularized  inverse  problem  at  x 

ax  (x)  Solution  of  the  discretized  regularized  inverse  problem 

with  exact  kernel  at  x 

ax  (x)  Solution  of  the  regularized  inverse  problem 

with  degenerate  kernel  at  x 

ax  (x)  Solution  of  the  discretized  regularized  inverse  problem 

with  degenerate  kernel  at  x 


Table  2.  Definition  of  function  spaces  and  norms. 


Notation  Explanation 


/ 

C(i 2) 
C*(£2) 

LP(Q) 

Dz  f 

wJ 

Hp(Q.) 

Il/llo 

\\f\\p 

ll/lloo 

II/IIlp(S2) 

\\f\\0,m 

WfWp.m 


The  complex  conjugate  of  the  function  / 

Space  of  continuous  complex-valued  functions  on  £2 

Space  of  complex-valued  /.-limes  continuously  differentiable  functions  on  £2 
L°°(£2)  =  [f\ess  supQ  |/(x)|  <  00} 

LP(£2)  =  {f\(fa  \f(x)\p  dx)Vp  <oo},pe  [l,oo) 
zth  weak  derivative  of  / 

Hp(V)  =  I/KEu-kp  ll^/llo)1/2  <  00}.  P  e  [l.  00) 

The  2)  norm  of  / 

The  Hp{i 2)  norm  of  / 

The  L°°{i 2)  norm  of  / 

The  LP(Q)  norm  of  / 

The  L 2  norm  of  /  over  the  mth  finite  element  Qm 
The  Hp  norm  of  /  over  the  mth  finite  element  Qm 


Q*  in  (2.4)  by  Gaussian  functions  with  sufficiently  low  variance,  whose  centres  are  located  at 
xJs  and  xld,  respectively. 
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In  this  work,  we  consider  the  finite-element  approximations  of  the  solutions  of  the  forward 
problem.  Hence,  before  we  discretize  the  forward  problem  (see  section  3.2),  we  consider  the 
variational  formulations  of  (2.1)-(2.2)  and  (2.3)-(2.4)  by  multiplying  (2.1)  by  a  test  function 
(p  e  II1  (Q)  and  integrating  over  £2  [9]: 

i&)\ 


Li 


V</>  •  DVgj  + 


—  /  i&>\ 

<P  {Ha  +  — J  gj 


<PQj 


dx+3“f  ^>d/  =  0’ 

Jan 


(2.5) 


where  the  boundary  integral  term  results  from  the  boundary  condition  (2.2). 

Equivalently,  we  can  express  (2.5)  by  defining  the  sesquilinear  form  hUp .  gj)  [16]: 

1 


b(<p,gj)  :=  A((p,gj)  +  [(p,  — gj)=  (</>,  Qj), 


(2.6) 


where 


A(cp, 


gj)  ■=  [  [v0  •  DVgj  +  (na  +  —')<pgj 
Ji 2  L  V  C  ' 


dx, 


(0,  Qj)  ■■=  f  (pQjdx , 

Jo. 


/  1  \ 

1  [ 

(0- 

2a  J an 

(pgj&L 

Similarly,  the  variational  problem  for  (2.3)-(2.4)  can  be  formulated  by  defining  the  sesquilinear 
form  b*(cp ,  g*): 


b  (<P,  g*)  :=  A{(p,  g*)  +  [<p,  —g* )  =  (0,  —  Q 


1 


2  a 


I 


2a 


(2.7) 


where  in  A(cp,  g*),  co  is  replaced  by  —  co. 

The  sesquilinear  forms  b(cp,  gj),  b*((p,  gf)  are  continuous  and  positive  definite  for 
bounded  D  and  /ia  [16].  As  a  result,  the  variational  problems  (2.6)  and  (2.7)  have  unique 
solutions,  which  follows  from  the  Lax-Milgram  lemma  [9],  The  solutions  gj  and  g*  of  the 
variational  problems  (2.6)  and  (2.7)  belong  to  Hl(Q),  which  results  from  the  //'-boundedness 
of  the  Gaussian  function  that  approximates  the  point  source  Qj  and  the  adjoint  source  Q* 
[16].  Assuming  D,  jia  e  C1  (Q)  and  noting  that  Qj,  Q*  e  II 1  (G):  the  solutions  gj,  g*  satisfy 
gj,  gf  e  Hgoc ( G)  (in  [12,  chapter  6.3,  theorem  2]).  This  last  condition  implies  (in  [12,  chapter 
5.6,  theorem  6]) 


gj,gfeC(C2). 


(2.8) 


2.2.  Inverse  problem 


In  this  work,  we  focus  on  the  estimation  of  the  absorption  coefficient;  therefore,  we  assume 
Z)(x)  is  known  for  all  x  e  G  U  iJQ.  To  address  the  nonlinear  nature  of  the  inverse  DOT 
problem,  we  consider  an  iterative  algorithm  based  on  repetitive  linearization  of  the  inverse 
problem  using  first-order  Born  approximation  [2],  As  a  result,  at  each  linearization  step, 
the  following  linear  integral  equation  relates  the  differential  optical  measurements  to  a  small 
perturbation  a  on  the  absorption  coefficient  /x„ : 


r  = 


-/  st 

■=  \  Hgj 
JSl 


(x)gj(x)a(x)  dx 


(x)o:(x)  dx 


^ —  (AaOt)i  j , 


(2.9) 


(2.10) 
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where  H, j  =  — g*gj  is  the  kernel  in  the  (i,  / ) t h  entry  of  the  matrix-valued  operator 
Aa  :  L°°(G)  — »  CNdxNs,gj  is  the  solution  of  (2.6),  g*  is  the  solution  of  (2.7),  and  r,j 
is  the  (i.  j)th  entry  in  the  vector  F  e  C Nd x  Ns ,  which  represents  the  differential  measurement 
at  the  / th  detector  due  to  the  yth  source.  Note  that  approximating  Q*  in  (2.4)  by  a  Gaussian 
function  centred  at  x‘d  implies  that  F;/-  corresponds  to  the  scattered  optical  field  evaluated 
at  xld,  after  filtering  it  by  that  Gaussian  function.  Thus,  the  Gaussian  approximation  of  the 
adjoint  source  models  the  finite  size  of  the  detectors.  Similarly,  approximating  Qj  in  (2.1)  by 
a  Gaussian  function  models  the  finite  beam  of  the  source. 

The  linear  operator  Aa  '■  L°°(Q)  — >  CNdXN‘  defined  by  (2.9)  is  compact  and  bounded  by 
(see  appendices  A  and  B) 

IIAIIl=°(£2)^/i  <  Ad^maxllgfUomaxllgyllo.  (2.11) 

i  j 

For  the  given  solution  space  L°°(G)  fora,  the  compactness  of  the  linear  operator  Aa  implies  the 
illposedness  of  (2.9).  Hence,  we  regularize  (2.9)  with  a  zeroth-order  Tikhonov  regularization. 
This  yields  the  following  equation  which  defines  our  inverse  problem  at  each  linearization 
step: 


Y  :=  A*T  =  (A*aAa  +  Xl)ak 
:=  JCax, 


(2.12) 

(2.13) 


where  X  >  0  and  a'  is  an  approximation  to  a.  In  this  representation,  I  is  the  identity  operator 
and  A*  :  CNdxNs  — >  Ll  (£2)  is  the  adjoint  of  Aa,  defined  by 


Nd,Ns  Nd,Ns 

e a*j)(x )  =  HujWfrj  =  E  (2.14) 

ij  iJ 

for  all  p  e  where  H*-  :=  -g*  gj  is  the  ( i ,  / ) t h  kernel  in  the  adjoint  operator  A*a.  Let 

A  :=  A^Aa,  then  A  :  L°°(f2)  — >  Z,'(r2)  is  defined  as  follows: 


Nd,N, 


(Aa)(x)  = 


=  E  HtjW  [  M.;(x)a(x)  dx 
ij  Jn 

:=  I  k(x,  x)a(x)  dx, 

Jn 

where  k  (x,  x)  stands  for  the  kernel  of  the  integral  operator  A  and  is  given  by 


(2.15) 


Nd,Ns 

k(x,x)=  E  H*j(x)HiJ(x). 
ij 


(2.16) 


Having  defined  the  adjoint  operator  A*,  we  note  that  the  operator  A  :  LX(Q)  —>  Ll(Q) 
is  compact  and  that  the  operator  K,  :  L°°(f2)  — >■  L 1  (G)  is  bounded  by  ||/C||  ^  II  Aa  || 2  +  A.  We 
assume  that  the  solution  ak  e  L°°(Q)  also  satisfies  a'-  e  //'  (G).  For  the  rest  of  the  paper,  we 
will  denote  L°°(G)  and  L 1  (Q)  by  A  and  Y,  respectively. 


3.  Discretization  of  the  inverse  and  forward  problems 

In  this  section,  we  outline  the  discretization  of  the  inverse  and  forward  problems. 
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3.1 .  Inverse  problem  discretization 

In  practice,  we  seek  a  finite-dimensional  approximation  to  the  solution  of  the  inverse  problem 
(2.13)  at  each  linearization  step.  Therefore,  we  discretize  (2.13)  by  projecting  it  onto  a 
finite-dimensional  subspace. 

Let  Xn  C  X  and  Y„  c  Y  denote  a  sequence  of  finite-dimensional  subspaces  of 
dimension  n  =  1,2,...,  spanned  by  first-order  Lagrange  basis  functions  \L\,  ....  Ln).  and 
{Xp} ,  p  =  \ n,  be  the  set  of  collocation  points  on  Q.  Then,  the  collocation  method 
approximates  the  solution  of  (2.13)  by  an  element  a'n  e  X„  which  satisfies 

(/Ca^)(xp)  =  y(xp),  p  =  l,...,n,  (3.1) 

where  we  express  on  a  set  {£2m}  of  finite  elements,  m  =  1, ,  N&  such  that  (J^A  Q.m  = 
as  follows: 

n 

<(x)  =  ^atL*(x).  (3.2) 

k=  1 

Note  that  in  (3.2),  ap  =  a'n(\p).  p  =  1 Then,  (3.1)  can  explicitly  be  written  as 

n 

Attp  +  VV  /  K(Xp,x)Lk(x)dx=y(xp),  p  =  l,...,n.  (3.3) 

t=t 

Equivalently,  the  collocation  method  can  be  interpreted  as  a  projection  with  the  interpolation 
operator  V„  :  Y  -*  Yn  defined  by  [18] 

n 

VnfOO  ■=  X  /(xp)Mx)’  X  e  (3.4) 

P=  1 

for  all  /  e  Y.  Then,  (3.1)  is  equivalent  to 

VnXa'n  =  V„y.  (3.5) 


3.2.  Forward  problem  discretization 

In  this  section,  we  consider  the  finite-element  discretizations  of  (2.6)  and  (2.7),  and  use 
their  solutions  to  approximate  ]  and  H*. .  As  a  result,  we  obtain  finite-dimensional 
approximations  to  K.  and  y . 

Let  Lk  denote  the  kth  first-order  Lagrange  basis  function.  Replacing  <j)  and  gj  in  (2.6) 
with  their  finite-dimensional  counterparts  <l>(x)  =  Ylk=\  PkLk(x),  Gj(x)  —  J]^:1c^Lj(x), 
and  replacing  0  and  g*  in  (2.7)  with  $(x)  =  Y!kU  PkLk( x),  G*(x)  =  J2kU  dkLk(x)  yields 
the  matrix  equations: 

Sc;  =  q, ,  (3.6) 

S*d;  =  q*,  (3.7) 

fore  j  :=  [ci,  Ci, ... ,  cnjY  andd,  :=  [d\ .  t/2 , . . . ,  dNi  ] 7 .  HereSandS*  are  the  finite-element 
matrices  and  q,  and  q*  are  the  load  vectors  resulting  from  the  finite-element  discretization 
of  (2.6)  and  (2.7).  Note  that  for  each  source  (detector),  the  dimension  of  the  finite-element 
solution  Gj  (G*)  can  be  different;  therefore,  /V;  (A,)  may  vary. 

The  Hl(Q)  boundedness  of  the  solutions  gj  and  g*  implies  that  the  discretization  errors 
ej  and  e*  in  G  ;  and  G*  are  bounded.  Let  { Qj„ }  denote  the  set  of  linear  elements  used  to 
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discretize  (2.6)  for  in  =  l, ,  NJA,  such  that  (J;^A  =  £2  for  all  j  =  1, . . . ,  Ns.  Similarly, 

let  {£2J,}  denote  the  set  of  linear  elements  used  to  discretize  (2.7)  for  n  =  1, . . . ,  /V*' ,  such 

that  U,^A  for  al  I  i  =  \ .....  N,j.  Then,  a  bound  for  e.j  and  e*  on  each  finite  element 

can  be  found  by  using  the  discretization  error  estimates  (in  [9,  theorem  4.4.4]): 

ll^yllo.mt  ^  C  11^7  II  l.mihm’  (3-8) 

ll<lloy  C || g* || (3.9) 

where  C  is  a  positive  constant,  IHIov  (IMIo.bO  and  IHliV  (II •  II 1,^ )  are  respectively  the  L2 
and  Hl  norms  on  £2/„  ( Q'n ) ,  and  hJm  (h‘n)  is  the  diameter  of  the  smallest  ball  containing  the 
finite  element  £2Jm  (Qjj  in  the  solution  Gj  (G*). 


3.3.  Discretization  of  the  inverse  problem  with  operator  approximations 

Substituting  the  finite-element  approximations  Gj  and  G*  in  (2.15)  and  (2.14),  and  using  the 
resulting  finite-dimensional  operator  approximations  in  (3.5),  we  obtain  the  following  linear 
system  in  terms  of  a'n  which  approximates  a*: 

Vnta'n  =  V„y.  (3.10) 

In  (3.10),  the  operator  K.  :  X  ->  Y  is  the  finite-dimensional  approximation  of  K.  in  (2.13)  and 
VniC  :  Xn  — >■  Yn.  Similarly, 

y:=A*a  r,  (3.11) 

where  A*a  is  the  approximation  to  the  adjoint  operator  A*,  obtained  by  substituting  Gj  and  G* 
in  (2.14). 

4.  Discretization-based  error  analysis 

As  a  result  of  the  discretization  of  the  forward  and  inverse  problems,  the  reconstructed  image 
a*  in  (3.10)  is  an  approximation  to  the  actual  image  ak.  Thus,  the  accuracy  of  the  reconstructed 
image  depends  on  the  error  incurred  by  the  discretization  of  the  forward  and  inverse  problems. 

In  this  section,  we  analyse  the  effect  of  the  discretization  of  the  forward  and  inverse 
problems  on  the  accuracy  of  DOT  imaging.  The  analysis  is  carried  out  based  on  the  inverse 
problem  at  each  linearization  defined  by  (2.13)  and  the  associated  kernel  k(x,  x). 

In  this  work,  we  follow  an  approach  which  allows  us  to  separately  analyse  the  effect  of 
the  discretization  of  each  problem  on  the  accuracy  of  the  reconstructed  optical  image.  In 
this  respect,  we  first  consider  the  impact  of  projection  (i.e.  inverse  problem  discretization) 
by  the  collocation  method  when  the  associated  kernel  at(x,  x)  in  (2.13)  is  exact.  Next,  we 
explore  the  case  in  which  the  kernel  is  replaced  by  its  finite-dimensional  approximation  (i.e. 
degenerate  kernel)  and  analyse  the  effect  of  the  forward  problem  discretization  on  the  accuracy 
of  the  reconstructed  image  without  projecting  (2.13). 

Our  analysis  reveals  that  even  if  the  kernel  is  exact,  the  accuracy  of  the  solution 
approximation  a'n  in  (3.5)  resulting  from  the  inverse  problem  discretization  depends  on  the 
kernel  k  (x,  x)  of  the  integral  operator.  Likewise,  the  error  in  the  reconstructed  optical  image 
due  to  the  discretization  of  the  forward  problem  is  a  function  of  the  inverse  problem  solution. 
These  results  suggest  that  the  discretization  of  the  inverse  and  forward  problems  cannot  be 
considered  independent  of  each  other. 
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4.1 .  Case  1.  The  kernel  k(x,  x)  is  exact 

In  this  section,  we  show  the  effect  of  projection  on  the  optical  imaging  accuracy.  In  the 
analysis,  we  assume  that  the  kernel  k  (x,  x)  is  exact.  We  first  prove  the  convergence  of  the 
projection  method  for  the  operator  /C,  and  then  analyse  the  effect  of  projection  on  the  imaging 
accuracy. 

Clearly,  the  inverse  operator  1C~ 1  :  Y  — >  X  exists  since  K.  is  positive  definite  for  X  >  0. 
Furthermore,  by  the  compactness  of  A  and  Riesz  theorem,  the  inverse  operator  Kr 1  is  bounded 
by 

II  (4.1) 

Lemma.  Projection  by  the  collocation  method  for  the  operator  JC  :  X  — »■  Y  converges. 
Specifically,  the  sequence  of  finite-dimensional  operators  V„K.  :  X„  — >  Y„  is  invertible  for 
sufficiently  large  n,  and  ( VnK,ylVnlCax  — »■  ax,  n  oo.  Furthermore, 

\\CPnicylvn)c\\x^xn  <  cj^r  (4-2) 

A 

for  some  Cm  >  0  independent  of  n. 

Proof.  See  appendix  C.  □ 

Based  on  the  lemma,  the  following  theorem  provides  an  upper  bound  for  the  L 1  (Q)  norm  of 
the  error  between  the  solution  ax  of  (2.13)  and  the  solution  ax  of  (3.5). 

Theorem  1.  Let  {£2„,}  denote  a  set  of  linear  finite  elements  used  in  the  discretization  of  the 
inverse  problem  (2.13)  for  m  =  1,  . . . ,  N\,  such  that  U(^A  Qm  —  £2,  and  hm  be  the  diameter 
of  the  smallest  ball  that  contains  the  with  element.  Then, 

Na 

fa1  —  oix  ||Li(Q)  ^  —  7nllr->-x„  l|aA||i,m^m 

m= 1 

+  -||^.lly^x„max||g*gJ||Li(Q)^  ^  llg*g/llo,JI«A'lli,m^m,  (4.3) 

m=  1  i,j 

where  C  is  a  positive  constant,  Vq  is  the  volume  of  £2  and  Tn  :  Y  —>  Xn  is  a  uniformly  bounded 
operator  given  byTn  —  (l+  yVnA)  'v„. 

Proof. 

ax  -  ax  =  [1  -  (VnIQ~lVnIC]ax 

=  11-  (VnlQ-'VnK.](aA  -  fr)  (4.4) 

since  [1  —  ('P„K,)~l'PnlC]fr  =  0,  where  i fr  e  Xn  is  the  interpolant  of  ax  [9].  Using  (C.2), 

[1  -  c vnxyxvnx }  =  i-(i+  jVka)  ]-vnK. 

=  1-Tn\ic,  (4.5) 

A 

where  Tn  :=  (l  +  yVnA)  lV„  is  a  uniformly  bounded  operator  (see  appendix  C).  We  use  1C. 
defined  by  (2.13)  and  (4.5)  in  (4.4)  to  obtain 

(/  -  ax  =  (1-  Tn)(ux  -fi)-  ^A(ax  -  fi). 

A 


(4.6) 
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Then,  we  use  the  definition  of  A  in  (4.6)  and  find 

ak  -ak  =  (I  -  %)(ak  -  f)  -  ^  [  *(•,  x)(ak  -  \j/){x)  dx. 

^  JQ 


This  leads  to 


(4.7) 


II  —  UnWLHn)  ^  l|2”  —  Tn\\Y^Xn  ll«X  —  llz,1(£j) 


+  -  \\Tn\\Y^X, 
A 


„  II  f  K{’’ 


x)(or  —  i fr)(x)  dx 


L'(S2) 


^  l|T—  T„\\y^x„\\uX  —  i/r II z,1(S2) 


1 


+  t  \\%  ||r->-x„ 

*■  -is 2  Jn 


xn  /  dx  /  |#c(x,  x 

Jn  Ji 2 


x)(ax  —  i/0(x)|  dx,  (4.8) 


The  second  term  in  (4.8)  can  be  rewritten  as 


*-f  dx  / 

JQ 


1 

T  \\%WY^Xn 
^  JQ  JQ. 


/c(x,  x)  (or  —  t/t)(x)|  dx 


=  -\\%\\y^x, 


./4e/ 

Ja  \m=1  Jn„ 


|/c(x,  x)(or  —  t/f)(x)|  dx 


Let  ea  be  the  interpolation  error: 

ea  ax  —  \[r. 
Then,  using  (2.16), 


NA  r 

zL 

m—  1  " 


Na 


r(x,  x)e„(x)|  dx  =  JZ  I 

m=\ 


Nd,Ns 


S* (x) 8 j (x) 8* (x) g j (x)  ea  (x) 


‘ j 


dx 


Na  Nd,Ns 


;>  >  '  |£*(x)gy(x)|  f  I g* (x)g j (x) 1 1 ea (x) |  dx 

..  ,  :  ,  h 2„, 


m=  1  i,j 
Na  Nd,Ns 

l^*(x)gy(x)|||g*gy||o.ml|ecr||o,ffl, 

m=  1  i,j 


(4.9) 


(4.10) 


(4.11) 


(4.12) 


where  (4.12)  follows  from  the  Schwarz’  inequality.  Note  that  gfgj  e  L2(Q)  by  considering 
(2.8)  holds  up  to  the  boundary  3£2  (see  [11,  theorem  2.1]). 

We  now  use  (4.9)  and  (4.12)  to  obtain 


\\\Tn\\Y^xn  [  dx(  f  |/c(x,  x)(ak  -  \J/)(x)\dx 
Jo.  \Jq  / 

^  »  Na  Nd,Ns 

~\\%\\y^x„  /  dxV  v  | g* (x)g j (x) 1 1| || 0im || ea ||  o,m  • 
A  ...  , 


(4.13) 


m=l  ij 


Using  the  bound  (4.13)  in  (4.8)  and  substituting  the  interpolation  error  bound  [9] 
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and  noting  ||eJLi(n)  ^  - 

II®*  —  ^  C^/v 


m= 1 


+  — 


c 


iVa  Nd,N, 

Tn\\Y^Xn  EE  WgtgjWm^WgtgjhmW^h,^. 


m= 1 


r 


Na  Nd,Ns 


+  - 


m= 1  i,j 


m  • 


(4.15) 

□ 


Remark  1. 


(i)  Theorem  1  shows  the  spatial  dependence  of  the  inverse  problem  discretization  on  the 
forward  problem  solution. 

(ii)  The  first  term  in  (4.15)  suggests  that  the  mesh  of  the  inverse  problem  be  refined  where 
|| ax  ||i  is  large. 

(iii)  The  second  term  in  (4.15)  shows  that  the  term  ||aA||iim  is  scaled  spatially  by  \\g*gj  ||o,m- 
Thus,  the  effect  of  the  interpolation  error  ea  (see  equation  (4.10))  in  the  inverse  problem 
solution  is  scaled  spatially  by  the  solution  of  the  forward  problem.  As  a  result,  the 
orientation  of  the  sources  and  detectors  with  respect  to  the  support  of  the  optical 
heterogeneity  determines  the  extent  of  the  bound  on  ||  of'*"  —  ax  ||Li(S2). 

(iv)  The  regularization  parameter  affects  the  bound  on  || —  a^|Li(S2). 

(v)  Increasing  the  number  of  sources  and  detectors  increases  the  bound  on  ||ax  —  ax  ||  Lt(Q)- 


Remark  2. 


(i)  Note  that  the  conventional  interpolation  error  estimate  given  in  (4.14)  depends  only  on 
the  smoothness  and  support  of  ax,  and  the  finite-dimensional  space  of  approximating 
functions  [9].  On  the  other  hand,  the  error  estimate  (4.3)  in  theorem  1  shows  that  the 
accuracy  of  the  reconstructed  image  ax  depends  on  the  orientation  of  the  absorptive 
heterogeneity  with  respect  to  the  sources  and  detectors,  as  well  as  on  the  bound  (4.14)  on 
the  interpolation  error. 

(ii)  An  error  bound  similar  to  (4.3)  follows  if  one  uses  the  Galerkin  method  [18]  instead  of 
the  collocation  method  for  projection. 

(iii)  The  interpolation  error  bound  (4.14)  can  be  modified  based  on  the  choice  of  the  basis 
function  in  (3.2)  and  the  smoothness  of  the  solution  ax  (theorem  4.4.4.  in  [9]).  For 
instance,  if  ax  e  fl2(Q)  and  quadratic  Lagrange  basis  functions  are  used,  then  (4.14)  can 
be  replaced  by 


^ a  Wo, in 


for  some  C  >  0. 

(iv)  An  error  bound  similar  to  (4.3)  can  be  derived  for  the  error  that  occurs  as  a  result  of 
the  discretization  of  the  inverse  problem  in  electrical  impedance  tomography,  optical 
fluorescence  tomography,  bioluminescence  tomography  and  microwave  imaging.  Note 
that  in  all  these  imaging  modalities,  the  forward  problem  is  modelled  by  a  PDE  and  the 
inverse  problem  can  be  interpreted  in  terms  of  a  linear  integral  equation,  whose  kernel  is 
related  to  the  solution  of  this  PDE. 
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(v)  Let  ys  be  the  perturbed  right-hand  side  y  of  (3.5)  due  to  the  presence  of  noise,  such  that 
||  ys  —  y  ||z,i(£2)  ^  5.  Then,  an  additional  term  is  introduced  to  the  error  bound  in  (4.3)  due 
to  this  perturbation: 

na 

|  ~  “nil  £,'(«)  ^  Cy/Va^T  —  Tn\\Y^x„  ^  llo^lli.m^m 

m=  1 

q  Na  Nd,Ns 

+  T||r„||^x„max||^||i,(Q)V  V  ||g*g;||o,ml|aA||i,m/im 

A  ‘J  i  .  . 

m=l  i.j 

Cm 

+  -^-<5,  (4.16) 

where  Cm  >  0  is  the  constant  in  (4.2)  with  the  use  of  first-order  Lagrange  basis  functions 
(see  appendix  C).  The  additional  term  Cm  S/A.  indicates  that  the  choice  of  basis  functions 
may  be  critical  in  the  presence  of  noise. 


4.2.  Case  2.  The  kernel  is  degenerate 

In  this  section,  we  first  derive  approximate  upper  bounds  for  the  approximation  errors  ||  K.  —  K.  || 
and  ||  y  —  y  || ,  which  result  from  the  discretization  of  the  forward  problem.  Then,  we  show  the 
effect  of  these  approximation  errors  on  the  accuracy  of  the  reconstructed  optical  image.  For 
notational  convenience,  we  will  drop  the  subscripts  on  the  norms  ||-||  where  necessary. 

The  operator  K,  :  X  —>  Y  is  bounded  with  a  bounded  inverse  /C  1  :  Y  ->  X.  By  the 
finite-element  approximation  of  the  associated  kernel,  the  sequence  of  bounded  linear  finite¬ 
dimensional  operators  JC  is  norm  convergent  \\K  —  /C||  — »■  0;  Nj,  N,  —>  oo,  for  j  =  1, . . . ,  Ns 
and  i  =  1, . . . ,  Nd,  and 

\\iC-l\\Y^x  <  1A,  (4.17) 


which  can  be  obtained  analogous  to  (4.1). 

In  the  following,  we  derive  an  explicit  approximation  to  the  error  ||/C  —  K.\\  in  terms  of  the 
associated  kernel  and  the  discretization  error  in  the  kernel  approximation.  The  result  is  then 
used  to  compute  the  error  in  the  reconstructed  optical  image  due  to  \\1C  —  K.\\. 

By  definition, 


Nd.Ns 

II  (A  -  A)a||;i  = 

hj 

where  G*,  G  j  are  the  finite-element  approximations  to  g*  and  gj ,  respectively.  We  can  expand 
ggj-GfGj  as 


/ 

JSi 


(g*(x)gj(x)  -  G*(x)Gj(x))a(x)  dx 


(4.18) 


gfgj  -  G*Gj  =  e*ej  +  G,e*  +  G*en  (4.19) 

where  e*  :=  g*  —  G*  and  ej  gj  —  Gj.  Replacing  G*  and  G j  respectively  with  g*  —  e*  and 
gj  ~  ej,  we  get 

gfgj -GfGj  =  gj^  +  tfej -tfej 

« gjef  +  gfej >  (4-20) 

where  we  neglect  the  term  e*  ej . 

We  can  express  K  —  K.  as 

1C  -  K.  =  A*aAa  -  A* A 


(4.21) 
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Following  a  similar  approach  as  above, 

A*aAa  -  A*aAa  =  (A*  -  A*a)(Aa  -  Aa)  +  A*a(Aa  -  Aa)  +  (A*a  -  A*a)Aa.  (4.22) 


As  a  result,  the  following  condition  holds 

II ft  -  K.W  <  ||(A  -  AHA  -  A) II  +  II A(A  -  Aa)  +  (A  -  A)AII-  (4.23) 

Since  Aa  —  —(A  —  A)  +  A.  (4-23)  can  be  rewritten  as 


\\ic-m  =  II  AA-  A  A II 

^  II (A  -  A) (A  -  A)ll  +2|iA(A  -  A) II 
«2ha(A-  A)  II, 

where  we  neglect  the  term  ||(A  —  AHA  —  A)ll- 
Similarly,  ||y  —  y  ||  can  be  interpreted  as 


r 

Nd,Ns 

Y  —  yWlHq.)  =  / 

E  A*(x)gXx)  -  c*(x)G,(x))r 

J 

Jn 

ij 

r 

Nd.Ns 

-  / 

E  WSJ  (x)  +  g*i  (x)ey  (x)) T ij 

ds 

Jn 

Uj 

(4.24) 


(4.25) 


where  the  error  in  r,  j  due  to  discretization  is  neglected  and  the  last  approximation  is  derived 
similar  to  (4.20). 

We  now  analyse  the  effect  of  the  forward  problem  discretization  on  the  accuracy  of  the 
reconstructed  optical  image.  Let  a'  be  the  solution  of 

/CA  =  y,  (4.26) 


where  K.  and  y  are  the  finite-dimensional  approximations  to  K,  and  y,  respectively.  Then, 
by  theorem  10.1  in  [18],  the  error  in  the  solution  ax  with  respect  to  the  actual  solution  ak  is 
bounded  by 

II A  -A|K  UwOC  -  AC) A ||  +  || y  -  y||}.  (4.27) 

A 

In  the  next  theorem,  we  will  expand  the  terms  in  (4.27)  to  show  explicitly  the  effect  of 
the  forward  problem  discretization  on  the  accuracy  of  the  inverse  problem  solution. 


Theorem  2.  Let  {A,}  denote  the  set  of  linear  elements  used  to  discretize  (2.6)  for 
m  =  1, . . . ,  Nja,  such  that  [_J^A  A,  =  £2  and  hi,  be  the  diameter  of  the  smallest  ball  that 
contains  the  element  An  in  the  solution  Gj ,  for  all  j  =  1, ...  ,NS.  Similarly,  let  { A, }  denote 

the  set  of  linear  elements  used  to  discretize  (2.7)  for  n  =  1, . . . ,  NA,  such  that  U(^A  Qln  —  £2 
and  h'n  be  the  diameter  of  the  smallest  ball  that  contains  the  element  A  in  the  solution  G*, 
for  all  i  =  1,  . . . ,  Nd-  Then,  a  bound  for  the  error  between  the  solution  ak  of  (2.13)  and  the 
solution  ax  of  (4.26)  due  to  the  approximations  K.  and  y  is  given  by 

c  /  Nd  N«,N, 

l|A  -  A||li(J2)  <  -max||g*g7||Li(Q)  |E  E  (^IgyAlloy  +  II  allodia  Hoy)  IlgflltyA 

V=i  n,j 

Ns  N{,Nd  \ 

+  E  E  (2||g*A||0,m;  +  Moo\\g*\\0,m])\\gjh,„chJm  ,  (4.28) 

j= 1  »u  / 

where  C  is  a  positive  constant. 
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Proof.  Using  (4.24),  (4.18)  and  (4.20),  we  can  write 

II (£  -  K)ax||Li(0)  «  2 \\A*a{Aa  -  A,)aA||L.(0) 


Nd,N, 

Y  g*(-)gj( ')  /  (gy(x)e*(x)  +  gf(x)ej(x))ax(x)  dx 
ij  Jn 


Nd,Ns  „ 

<  2max||g*g;||Li(£2)  Y  1  l(£./(x)e*(x)  +  g*(x)e; (x))ok(x)|  dx.  (4.29) 

u  •/£-’ 


An  upper  bound  for  the  integral  in  (4.29)  can  be  obtained  as  follows: 

f  |  (gj(x)e*(x)  +  gt(x)ej(x))ax(x)  |  dx 
Jn 


NV 


Ni 


^  ^  '  II  ej  llo.n'  II  g  ja  ||W+El^ll«Wllft“  llo.mJ- 

n= 1  m  =  1 


(4.30) 


Note  that  g7aA  e  Z/(£2)  since  |g7aA|  ^  IgylHonloo-  Similarly,  g*ak  e  L~(£2)  since 
|gf«A|  ^  Igflll^Hoo.  Using  (4.30)  in  (4.29), 

||(£-K)ax||Li(0)  ^2max||g;gy-||ii(n) 

(Nd  Nf,Ns  N s  N{,Nd 

E  E  iieriio,«-ii^aAiioy  +  E  E  n^iio.mtii^r^iio.mt 

i  =  l  n,j  j=  1  m,i 


(4.31) 


To  compute  an  upper  bound  for  ||y  —  y  ||  using  (4.25),  we  first  write 


/ 

Jn 


dx 


Nd,Ns 

Y  (ef^gJOO  +  gfixjeJOOWi, 

<,j 

r  Nd,Ns  _  _ 

<  max|r,  ;|  /  V  \e*  (x)g j  (x)  +  g*  (x)ej  (x)  |  dx 

Jn  jy 

/  Nd  N«,N,  N,  N{,Nd 

^  max|r,j|  EE  lk*lloylkylloy  +  EE  H^fllo.mJ  \\cj  \\0,mJ 

\i=l  n,j  i=j  m,i 


Noting  (2.9), 


max|r;,7|  <  max||g*g7- 11^1(52) Halloo, 

l,J  l,J 


(4.32) 


(4.33) 


which  leads  to 

(Nd  N%,NS  Ns  N{,Nd 

E  E  lk*lloylkylloy  +  E  E  Iloilo, mj Ik; Nov 

>'=  1  « ,j  i=j  m,i 

/  Nd  N«,NS  JVS  Ni,Nd 

<  tnax||  g*gy  II  z.1  (fi)  II « II  oo  (EE  Ik*  Hoy  Iky  Hoy  +  EE  lk*llovlkyllov  |  • 


\i= 1  n,j 


i=J  m,i 


(4.34) 
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We  now  use  (4.31),  (4.34),  the  corresponding  discretization  error  estimates  (3.8)-(3.9),  and 


□ 


(4.27)  to  obtain  (4.28). 

Remark  3. 


(i)  Theorem  2  suggests  the  use  of  meshes  designed  individually  for  the  solutions  Gj ,  j  = 
1  ,...,NS  and  G*,i  =  1  ,...,Nd. 

(ii)  Theorem  2  states  explicitly  the  effect  of  the  forward  problem  discretization  on  the  accuracy 
of  the  inverse  problem  solution.  In  this  context,  theorem  2  suggests  a  discretization  scheme 
for  the  forward  problem,  where  the  discretization  criterion  is  based  on  the  inverse  problem 
solution  accuracy,  rather  than  the  accuracy  of  the  forward  problem  solution. 

(iii)  For  each  source,  when  solving  for  Gj,  hJm  has  to  be  kept  small  where  (2\\  g*  a'-  Wq,,,,  + 
IMIoollgf  llo,mj)llgjlli,mj  is  large.  Note  that  ||g7||i,mj  will  be  large  on  the  elements  close 
to  the  /th  source. 

(iv)  For  each  detector,  when  solving  for  G*,  h'n  has  to  be  kept  small  where  (2 1| || o,n'  + 
ll«lloollgjllo,n'')llgI*lli,ni  is  large.  Note  that  ||gf||iy  will  be  large  on  the  elements  close  to 
the  i  th  detector. 

(v)  |  gj  and  \g*\  are  higher  close  to  the  sources  and  detectors,  respectively.  Therefore,  hJm 
has  to  be  small  around  the  /th  source  and  around  all  detectors,  where  ak  is  nonzero. 
Likewise,  h'n  has  to  be  small  around  the  *  th  detector  and  around  all  sources,  where  a'  is 
nonzero. 

(vi)  If  ak  is  nonzero  on  the  whole  domain  Q,  then  the  error  may  become  higher  depending  on 
the  magnitude  of  |  gj  and  \g*\. 

(vii)  The  regularization  parameter  affects  the  bound  on  ||aA  —  aA||ii(Q). 

(viii)  Increasing  the  number  of  sources  and  detectors  increases  the  bound  on  ||aA  — 


Remark  4. 


(i)  Note  that  the  finite-element  discretization  error  estimates  (3.8) — (3.9)  depend  on  only  the 
smoothness  and  support  of  g  j  and  g* ,  and  the  finite  dimensional  space  of  approximating 
functions  [9].  However,  the  error  estimate  (4.28)  in  theorem  2  shows  that  the  accuracy 
of  the  reconstructed  image  a 1  depends  on  the  orientation  of  the  absorptive  heterogeneity 
with  respect  to  the  sources  and  detectors,  as  well  as  on  the  finite-element  discretization 
error  estimates  (3.8) — (3.9).  In  this  respect,  the  estimate  (4.28)  in  theorem  2  shows  that 
reducing  the  discretization  error  in  the  solutions  Gj  and  G*  of  the  forward  problem  may 
not  ensure  the  accuracy  of  the  reconstructed  absorption  image  (see  [14]). 

(ii)  In  case  a  different  discretization  approach  such  as  finite  difference  [20]  or  finite 
volume  [13]  is  used  to  solve  the  forward  problem,  theorem  2  can  be  modified  in  a 
straightforward  manner  by  replacing  the  discretization  error  estimates  (3.8)  and  (3.9) 
with  the  corresponding  error  estimates  specific  to  the  method  of  choice  [13,  20], 

(iii)  Let  ys  be  the  perturbed  right-hand  side  y  of  (4.26)  due  to  the  presence  of  noise,  such  that 
||  j>a  —  y  ||  /j  (Q)  ^  <5.  Then,  an  additional  term  is  introduced  to  the  bound  in  (4.28)  due  to 
this  perturbation: 


L>(Q)  <  Vmax|lg*<?yllii(fi) 

A  i,j 


(4.35) 
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Clearly,  the  additional  term  <5//,  due  to  the  presence  of  noise  in  (4.35)  is  independent  of 
the  discretization  of  the  forward  problem. 

(iv)  Theorem  2  provides  a  general  framework  to  analyse  the  error  in  reconstructed  optical 
images  resulting  from  the  perturbations  in  the  kernel  of  the  linear  integral  equation  (2.16). 
In  general,  a  perturbation  in  the  kernel  of  the  linear  integral  equation  (2.16)  can  occur 
due  to  errors  resulting  from  the  numerical  integration  of  (2.6)-(2.7),  the  approximation  of 
the  boundary  9  £2,  the  inaccurate  approximation  of  the  source  Q  f  and/or  the  background 
optical  properties.  Furthermore,  the  analysis  framework  in  theorem  2  can  be  used  to 
analyse  the  effect  of  linearization  of  the  Lippmann-Schwinger-type  equations  [10]  using 
Born  approximation  on  the  accuracy  of  the  reconstructed  optical  images  [15]. 

(v)  A  bound  similar  to  (4.28)  can  be  derived  for  the  error  that  occurs  as  a  result  of 
the  discretization  of  the  forward  problem  in  electrical  impedance  tomography,  optical 
fluorescence  tomography,  bioluminescence  tomography  and  microwave  imaging. 

4.3.  Iterative  Born  approximation 

In  this  section,  we  explore  the  error  in  the  inverse  problem  solution  within  an  iterative 
linearization  approach. 

The  error  analysis  presented  in  this  paper  covers  the  error  which  results  from  the 
discretization  of  the  forward  and  inverse  problems.  If  a  is  sufficiently  low,  then  one  iteration 
suffices  to  solve  the  inverse  problem  and  the  error  analysis  discussed  above  applies.  When 
iterative  linearization  is  considered  to  address  the  nonlinearity  of  the  inverse  problem,  we  can 
make  use  of  the  error  analysis  at  each  linearized  step  as  follows:  let  and  a'n(t)  be  the 
actual  solution  of  the  regularized  inverse  problem  (2.13)  and  the  solution  of  (3.10)  at  the  fth 
linearization  step,  respectively.  At  the  end  of  the  ( r  —  l  )th  linearization  step,  the  absorption 
coefficient  estimate  at  x  is  given  by  /t^r_1-l(x)  =  /z,®(x)  +  Y^t=i  “«(/)(x)>  where  &n(t)  has  an 
error  due  to  discretization  with  respect  to  the  actual  solution  a^,  and  /a®  is  the  initial  guess  for 
the  background  absorption  coefficient.  In  the  next  linearization,  an  error  on  the  new  solution 
update  f4r)  will  be  introduced  due  to 

(i)  projection  (inverse  problem  discretization), 

(ii)  the  error  (1C  —  /C)(r-1)  in  the  operator  (/C)(r-1)  and  the  error  (y  —  y)(r~1]  in  (y)<r-1) 
resulting  from  the  forward  problem  discretization,  and 

(iii)  the  error  in  the  (r  —  l)th  update  Air-1\  resulting  from  the  discretization  of  the  forward 
and  inverse  problems.  Note  that  Aa -1>  appears  as  a  coefficient  in  the  boundary  value 
problems  (2.1)-(2.2)  and  (2.3)-(2.4).  An  error  in  this  coefficient  implies  perturbation  in 
the  solutions  of  (2.1)-(2.2)  and  (2.3)-(2.4).  As  a  result,  Gj  and  G*  will  have  error  terms 
in  addition  to  the  discretization  error. 

As  a  result,  the  error  in  Ai' '  at  the  rth  iteration  is  bounded  by 


r 


r 


(4.36) 


t=  1 


t=l 


assuming  that  the  initial  guess  /a®  for  the  background  absorption  is  approximated  accurately 
while  solving  the  boundary  value  problems  (2.1)-(2.2)  and  (2.3) — (2.4)  at  the  first  iteration, 
that  is  /a®  (x)  -  J2k=i  Ma0)(xi)^t:(x)  -*  0,  for  all  x  e  £2. 
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5.  Conclusion 

In  this  work,  we  presented  an  error  analysis  to  show  the  relationship  between  the  error  in 
the  reconstructed  optical  absorption  images  and  the  discretization  of  the  forward  and  inverse 
problems.  We  summarized  the  implications  of  the  error  analysis  in  two  theorems  which  provide 
an  insight  into  the  impact  of  forward  and  inverse  problem  discretizations  on  the  accuracy  of 
the  reconstructed  optical  absorption  images.  These  theorems  show  that  the  error  in  the 
reconstructed  optical  image  due  to  the  discretization  of  each  problem  is  bounded  by  roughly 
the  multiplication  of  the  discretization  error  in  the  corresponding  solution  and  the  solution  of 
the  other  problem.  In  particular,  theorem  2  shows  that  solving  the  diffusion  equation  and  the 
associated  adjoint  problem  accurately  may  not  ensure  small  values  for  || K.  —  K. ||  and  ||  y  —  y 1| , 
which  may  lead  to  large  errors  in  the  reconstructed  optical  images,  depending  on  the  value 
of  the  regularization  parameter.  Similarly,  relatively  large  discretization  error  in  the  solution 
of  the  forward  problem  may  have  relatively  low  impact  on  the  accuracy  of  the  reconstructed 
optical  images,  depending  on  the  source-detector  configuration,  and  orientation  with  respect 
to  the  optical  heterogeneities.  We  have  also  shown  that  the  error  estimates  can  be  extended  to 
include  the  effect  of  noise  on  the  overall  error  in  the  reconstructed  images. 

The  error  analysis  presented  in  this  work  motivates  the  development  of  novel  adaptive 
discretization  schemes  based  on  the  error  estimates  in  theorems  1  and  2.  In  the  sequel  of  this 
work,  we  propose  two  novel  adaptive  discretization  algorithms  for  the  forward  and  inverse 
problems  [14],  and  justify  the  validity  of  theorems  1  and  2. 

The  error  analysis  can  be  extended  to  show  the  effect  of  the  discretization  error  on  the 
accuracy  of  the  simultaneous  reconstruction  of  scattering  and  absorption  coefficients,  which 
will  be  the  focus  of  our  future  work.  Finally,  we  note  that  the  error  analysis  introduced  in 
this  paper  is  not  limited  to  DOT,  and  can  easily  be  adapted  for  similar  inverse  parameter 
estimation  problems  such  as  electrical  impedance  tomography,  bioluminescence  tomography, 
optical  fluorescence  tomography,  microwave  imaging  etc. 
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Appendix  A.  Boundedness  of  Aa 


(A.l) 


We  can  write  the  following  inequality: 


(A.2) 
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Using  Schwarz’  inequality,  we  can  write  an  upper  bound  for  the  summation  as  follows: 

Nd,Ns  Nd,N, 

£  /  I  Hi,j  (x)  I  dx  =  £  Ilsfg,-Il£i(n) 

•J  U 

Nd,N, 

<  £  II  Wig;  llo 

ij 

^  NdNs  max  H^fHo  max  Iloilo,  (A.3) 

i  j 

which  leads  to 

\\Aaay  ^  NjNs  max||g*||0  max||gj  ||o||a||oo- 
*  j 

Therefore  an  upper  bound  for  the  norm  of  Aa  is  given  by 

IIAIIl“(Q)-^  <  ^^max||g*||0max||g;'||0.  (A.4) 

i  j 

The  boundedness  of  gj  and  g*  imply  that  Aa  is  bounded. 

Appendix  B.  Compactness  of  Aa 

Aa  is  bounded  by  (A.4).  Furthermore  Aa  maps  the  infinite-dimensional  subspace  L°°(f2)  to  a 
finite-dimensional  subspace  CN‘,xNs ,  that  is  the  range  R(Aa )  of  Aa  satisfies  R(Aa )  e  <p,NdxN> 
due  to  the  finite  number  of  sources  and  detectors.  As  a  result,  Aa  is  compact  [18].  The  inverse 
problem  is  illposed  as  a  consequence  of  compactness  [18]. 

Appendix  C.  Proof  of  the  lemma 

The  identity  operator!  is  a  bounded  operator  with  bounded  inverse  and  =  1 :  Xn  — »■ 

X„.  Furthermore,  \\'P„  ||  x-*x„  is  bounded  for  first-order  Lagrange  basis  functions  [4, 18].  Thus, 
projection  by  collocation  converges  for  the  identity  operator.  A  is  bounded  and  compact,  and 
K  —  XI  +  A  is  injective,  with  bounded  inverse  given  by  (4.1).  As  a  result,  by  theorem  13.7  in 
[18],  the  projection  method  also  converges  for  K,  =  XI  +  A.  Convergence  of  projection  for  K, 
implies  ( V„JC)~1VnICak  ax ,  n  — >  oo  for  (P„/C)_IP„/C  :  X  ->  Xn  [18], 

It  follows  from  the  proof  of  theorem  13.7  in  [18]  that  ( 1  +  jPnA)  1  :  Y„  — »  X„  exists 
and  is  uniformly  bounded  for  all  sufficiently  large  n.  Then  from  P„t C  =  ).Pn  (l  +  jP„A)  = 
k(l  +  jVnA),  it  follows  that  P„K.  :  Xn  — »■  Yn  is  invertible  for  all  sufficiently  large  n  with  the 
inverse  given  by 


(7VC)-1  =  (l+^-PnA^  J. 

(C.l) 

As  a  result  we  can  write  (' PnK,)~lV„K,  as  follows: 

C PnK.)-lVnK.  =  (l+  \vnA\  ^ P„IC . 

(C.2) 

Thus, 

\\{VnKTlVn1C\\x^xn  <  CMmr 

A 

(C.3) 

where  Cm  >  0  is  independent  of  n,  using  the  facts  that  projection  by  collocation  method 
converges  for  the  identity  operator  and  (X  +  jPnA)  is  uniformly  bounded. 
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Abstract 

In  part  I  (Guven  et  al  2007  Inverse  Problems  23  1115-33),  we  analysed 
the  error  in  the  reconstructed  optical  absorption  images  resulting  from  the 
discretization  of  the  forward  and  inverse  problems.  Our  analysis  led  to  two  new 
error  estimates,  which  present  the  relationship  between  the  optical  absorption 
imaging  accuracy  and  the  discretization  error  in  the  solutions  of  the  forward 
and  inverse  problems.  In  this  work,  based  on  the  analysis  presented  in  part  I, 
we  develop  new  adaptive  discretization  schemes  for  the  forward  and  inverse 
problems  in  order  to  reduce  the  error  in  the  reconstructed  images  resulting 
from  discretization.  The  proposed  discretization  schemes  lead  to  adaptively 
refined  composite  meshes  that  yield  the  desired  level  of  imaging  accuracy  while 
reducing  the  size  of  the  discretized  forward  and  inverse  problems.  We  present 
numerical  experiments  to  validate  the  error  estimates  developed  in  part  I  and 
show  the  improvement  in  the  accuracy  of  the  reconstructed  optical  images  with 
the  new  adaptive  mesh  generation  algorithms. 

(Some  hgures  in  this  article  are  in  colour  only  in  the  electronic  version) 


1.  Introduction 

Numerical  approaches  in  solving  the  forward  and  inverse  problems  in  diffuse  optical 
tomography  (DOT)  pose  a  tradeoff  between  computational  efficiency  and  imaging  accuracy. 
This  tradeoff  is  a  direct  consequence  of  the  discretization  of  the  forward  and  inverse  problems 
[2,  9]  and  the  size  of  the  resulting  discrete  forward  and  inverse  problems.  The  imaging 
accuracy  depends  on  the  discretization  error  in  the  forward  and  inverse  problem  solutions. 
On  the  other  hand,  attempting  to  minimize  the  discretization  error  in  the  solutions  of  both 
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problems  separately  implies  a  significant  increase  in  the  size  of  the  discrete  forward  and  inverse 
problems.  Hence,  it  is  important  to  understand  the  relationship  between  the  discretization  error 
and  the  resulting  error  in  the  solution  of  the  inverse  problem.  Such  a  relationship  can  illuminate 
the  mutual  dependence  of  the  forward  and  inverse  problem  solutions  and  identify  the  factors 
that  control  the  extent  to  which  the  discretization  error  in  the  solutions  of  the  forward  and 
inverse  problems  affects  the  accuracy  of  the  reconstructed  optical  images. 

In  part  I  of  this  two-part  study,  we  presented  an  error  analysis  which  showed  the  effect  of 
discretization  of  the  forward  and  inverse  problems  on  the  accuracy  of  the  reconstructed  optical 
absorption  images  [9].  The  analysis  led  to  two  new  error  estimates  that  took  into  account  the 
interdependence  of  the  forward  and  inverse  problems  (see  section  2).  In  the  second  part  of  our 
work,  based  on  the  error  analysis  presented  in  part  I,  we  develop  new  adaptive  discretization 
schemes  for  the  forward  and  inverse  problems.  The  resulting  locally  refined  meshes  reduce 
the  error  in  the  reconstructed  optical  images  while  keeping  the  size  of  the  discrete  forward  and 
inverse  problems  relatively  small. 

There  has  been  extensive  research  on  adaptive  mesh  generation  for  the  numerical  solution 
of  partial  differential  equations  (see  [9]  for  a  list  of  publications)  and  inverse  parameter 
estimation  problems  to  reduce  the  undesired  effect  of  discretization  error  [4,  14].  In  the 
area  of  DOT,  in  [3]  it  was  numerically  shown  that  approximation  errors  resulting  from  the 
discretization  of  the  forward  problem  can  lead  to  significant  degradation  in  the  quality  of 
the  reconstructed  images.  In  that  work,  the  error  in  the  reconstructed  images  is  minimized 
by  using  an  enhanced  imaging  model  that  treats  this  additional  approximation  error  within 
the  Bayesian  framework.  Alternatively,  several  investigators  have  reported  on  adaptive 
discretization  schemes  for  the  forward  and  inverse  problems  to  address  the  optical  image 
degradation  due  to  discretization.  In  [6]  a  ‘data-driven  zonation’  scheme,  which  can  be 
viewed  as  an  adaptive  discretization  algorithm,  was  proposed  for  fluorescence  imaging  [6]. 
In  [8],  we  presented  a  region-of-interest  (ROI)  imaging  scheme  for  DOT,  which  employed 
a  multi-level  algorithm  on  a  non-uniform  grid.  The  non-uniform  grid  is  designed  so  as  to 
provide  finer  spatial  resolution  for  the  ROI  which  corresponds  to  the  tumour  region  as  indicated 
by  a  priori  anatomical  image.  In  [16]  an  a  priori  non-uniform  mesh  design  which  provides 
high  resolution  at  the  heterogeneities  and  near  boundary  regions  was  proposed.  In  that  work, 
the  mesh  refinement  is  independent  of  the  source-detector  configuration  and  the  location  of 
the  heterogeneities.  In  [7]  a  dual  mesh  strategy  was  proposed,  in  which,  a  relatively  fine 
uniform  mesh  is  considered  for  the  forward  problem  discretization  and  a  coarse  uniform  mesh 
is  generated  for  the  inverse  problem  discretization.  In  the  same  study,  an  adaptive  refinement 
scheme  was  proposed  for  the  inverse  problem  discretization,  but  no  adaptive  refinement  was 
considered  for  the  solution  of  the  forward  problem.  Another  dual  mesh  strategy  which  makes 
use  of  a  priori  ultrasound  information  was  presented  in  [10].  In  that  work,  the  dual  mesh 
is  a  coarse  mesh  for  the  background  tissue  and  a  relatively  fine  mesh  for  the  heterogeneity, 
similar  to  the  approach  in  [8].  In  fluorescence  imaging,  a  dual  adaptive  mesh  strategy  was 
used  to  discretize  the  inverse  problem  and  the  associated  coupled  diffusion  equations,  where 
the  refinement  criterion  is  based  on  a  posteriori  discretization  error  estimates  [12].  Note  that 
in  all  these  studies  [6-8,  10,  12,  16],  the  mesh  refinement  criteria  considered  for  the  inverse 
(forward)  problem  disregard  the  impact  of  the  solution  of  the  forward  (inverse)  problem.  In 
other  words,  the  discretization  of  each  problem  is  considered  independently  of  the  solution  of 
the  other  problem. 

In  this  work,  based  on  the  two  error  bounds  provided  by  the  error  analysis  in  part  I  [9],  we 
introduce  an  adaptive  discretization  scheme  for  the  forward  and  inverse  problems,  respectively. 
We  remark  that  the  mesh  refinement  criterion  for  each  problem  comprises  the  discretization 
error  in  the  corresponding  problem  solution,  scaled  spatially  by  the  solutions  of  both  problems. 
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Table  1.  Definition  of  function  spaces  and  norms. 


Notation  Explanation 


/ 

C(Q) 

Ck(Q.) 

L°°(£2) 

LP(Q) 

Kf 

HP(Q) 

ll/llo 
II /Up 

ll/lloo 

II/IIlp(S2) 


ll/llp,m 


The  complex  conjugate  of  the  function  / 

Space  of  continuous  complex-valued  functions  on  Q 

Space  of  complex-valued  &-times  continuously  differentiable  functions  on  £2 

L°°(Q)  =  {/|esssupn  |/(x)|  <  00} 

LP(Q)  =  {/  |  (fa  |/(x)Fdx)  /]  <  00},  p  e  [1,  00(b)) 
zth  weak  derivative  of  / 

Hp(V)  =  {/  |  (E|Z|<P  h/lCf  <  00).  P  6  [1-  00) 

The  L2(f2)  norm  of  / 

The  HP(Q)  norm  of  / 

The  L°°(£2)  norm  of  / 

The  LP(Q)  norm  of  / 

The  L 2  norm  of  /  over  the  mth  finite  element  Q,„ 

The  Hp  nonn  of  /  over  the  mth  finite  element  Qm 


Thus,  the  proposed  adaptive  mesh  generation  algorithms  address  the  interdependence  between 
the  solutions  of  the  forward  and  inverse  problems  and  take  into  account  the  orientation  of  the 
source-detectors  and  the  absorptive  perturbations.  This  makes  the  adaptive  discretization 
algorithms  introduced  in  this  paper  different  from  the  previous  approaches  [6-8,  10,  12,  16]. 
The  simulation  experiments  validate  the  implications  of  our  error  analysis  and  show  that  the 
proposed  mesh  generation  algorithms  significantly  improve  the  accuracy  of  the  reconstructed 
optical  images  for  a  given  number  of  unknowns  in  the  discrete  forward  and  inverse  problems. 
We  specifically  show  that  using  the  discretization  error  estimates,  which  do  not  take  into 
account  the  interdependence  of  forward  and  inverse  problems  as  a  criterion  for  discretization, 
may  lead  to  severely  degraded  image  reconstructions  (see  simulation  study  3).  We  also 
discuss  the  computational  complexity  of  the  proposed  adaptive  mesh  generation  algorithms 
and  compare  it  to  the  computational  complexity  of  mesh  generation  algorithms  based  on 
the  conventional  discretization  error  estimates.  We  finally  note  that  the  proposed  adaptive 
mesh  generation  algorithms  can  be  adapted  for  similar  inverse  parameter  estimation  problems, 
such  as  electrical  impedance  tomography,  optical  fluorescence  tomography,  bioluminescence 
tomography,  microwave  imaging,  etc. 

The  outline  of  this  paper  is  as  follows:  in  section  2,  we  give  a  brief  overview  of  the  forward 
and  inverse  DOT  problems  and  recall  the  two  theorems  presented  in  part  I  which  summarize 
the  impact  of  discretization  on  the  accuracy  of  the  reconstructed  optical  images.  In  section  3, 
based  on  these  two  theorems,  we  introduce  the  adaptive  mesh  generation  algorithms  for  the 
solution  of  the  forward  and  inverse  problems  and  discuss  their  computational  complexity.  In 
section  4,  we  present  our  experimental  results,  which  is  followed  by  section  5.  The  appendix 
includes  the  solution  of  a  model  problem  used  to  initiate  the  adaptive  mesh  generation. 


2.  Overview 

In  this  section,  we  first  briefly  define  the  forward  and  inverse  problems  in  DOT.  Next,  we 
state  theorems  1  and  2  presented  in  the  first  part  of  this  work  [9]  to  recall  the  effect  of  the 
discretization  of  the  forward  and  inverse  problems  on  the  accuracy  of  optical  absorption  image 
reconstruction.  We  refer  to  table  1  for  the  explanation  of  the  notation  associated  with  functions 
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and  their  norms.  Note  that  calligraphic  letters  are  used  to  denote  the  operators,  e.g.  Aa,T, 
tC  etc. 


2.1.  Forward  and  inverse  problems  in  DOT 

We  consider  the  following  boundary  value  problem  to  model  the  near-infrared  light  propagation 
in  a  bounded  domain  £2  c  R3  with  Lipschitz  boundary  3  £2  [2,  5]: 


-V  •  D(x)Vgj(x)  +  (  p,a(\)  +  —  )gj(x)  =  Qj(x)  xeQ,  (2.1) 


gi(x)  +  2aD(\)^-(x)  =  0  x  e  3£2, 
3  n 


(2.2) 


where  gj  (x)  is  the  photon  density  at  x,  Qj  is  the  point  source  located  at  the  source  position 
xJs,  j  =  1 Ns,  where  Ns  is  the  number  of  sources,  D(x)  is  the  diffusion  coefficient  and 
H„  (x)  is  the  absorption  coefficient  at  x,  i  =  \/— I,  o>  is  the  modulation  frequency  of  the 
source,  c  is  the  speed  of  the  light,  a  =  (l  +  i?)/(l  —  R)  where  R  is  a  parameter  governing 
the  internal  reflection  at  the  boundary  3  £2,  and  3  •  /3 n  denotes  the  directional  derivative  along 
the  unit  normal  vector  on  the  boundary.  The  boundary  value  problem  (2.1)-(2.2)  constitutes 
the  forward  problem  in  DOT  together  with  the  associated  adjoint  problem  [2,  9]: 

-V  ■  D(x)Vg*(x)  +  LMx)  -  ^Jgf(x)  =  0  x  e  £2,  (2.3) 


3  p* 

g*(x)  +  2aD(x)-^-(x)  =  Q*(x)  x  e  3£2, 
on 


(2.4) 


where  Q*  is  the  adjoint  source  located  at  the  detector  position  x'd,  i  =  1 , ...  ,Nd,  where  N,i 
is  the  number  of  detectors.  Note  that  we  approximate  the  point  source  Qj  in  (2.1)  and  the 
adjoint  source  Q*  in  (2.4)  by  Gaussian  functions  with  sufficiently  low  variance,  whose  centres 
are  located  at  xJs  and  x'd,  respectively. 

In  this  work,  we  focus  on  the  estimation  of  the  absorption  coefficient  and  consider  an 
iterative  algorithm  based  on  repetitive  linearization  of  the  inverse  problem  using  first-order 
Born  approximation.  Using  a  zeroth-order  Tikhonov  regularization  to  address  the  illposedness, 
the  inverse  problem  at  each  iteration  reads 


y(x)  :=  C4jr)(x)  =  [(A*aAa  +  Xl)al](x) 
:=  /  /c(x,  x)o;^(x)  dx  +  Aq!^(x) 

Jn 


:=  (/Cor)(x), 


(2.5) 

(2.6) 


where  T  e  CNdXN°  is  the  vector  of  differential  measurements  at  N,i  number  of  detectors  due 
to  Ns  number  of  sources,  as  a  result  of  the  small  perturbation  a  on  the  background  absorption 
coefficient  p.a,  and  a 1  is  the  solution  of  the  regularized  inverse  problem.  In  (2.5),  k(x,  x)  is 
the  kernel  of  the  integral  equation,  given  by  [9] 


Nd,N, 


x(x,  x)=  H*j(x)Hij(x), 


IJ 


(2.7) 
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where  Ht j  :=  — gfgj  is  the  (i,  j) th  kernel  of  the  matrix-valued  operator  Aa  :  L°°(£2)  — > 
CNdxN‘  and  H*j  :=  —g*g]  is  the  (;,  j)th  kernel  of  the  adjoint  operator^!*  :CNdxNs  — >  Ll(Q) 
defined  by 

Nd,Ns  Nd,Ns 

=  J2  Kjtohi  =  E  -gtUigjtoPiJ’  (2.8) 

ij  Uj 

for  all  p  e  CNdXN‘ .  We  note  that  gj  and  g*  in  (2.7)  and  (2.8)  are  the  solutions  of  the  variational 
formulations  of  (2.1)-(2.2)  and  (2.3)-(2.4),  respectively  [9],  Assume  that  D,  An  €  C*(f2). 
Noting  Qj ,  Q*  e  H 1  (Q),  the  solutions  gj,  g*  e  II 1  (  f2)  of  the  variational  formulations  of  the 
boundary  value  problems  (2.1)-(2.2)  and  (2.3)-(2.4)  also  satisfy  [9] 


gj,g*eC(Sl).  (2.9) 

For  the  rest  of  the  paper,  we  will  denote  L°°  (Q)  and  L 1  ( Q)  by  X  and  Y,  respectively. 

Below  we  summarize  the  two  theorems  of  part  I  [9]  and  provide  the  error  estimates 
which  will  be  used  in  the  design  of  adaptive  meshes  for  the  discretization  of  the  forward  and 
inverse  DOT  problems.  In  this  respect,  we  first  consider  the  impact  of  the  inverse  problem 
discretization  when  the  associated  kernel  k(x,  x)  in  (2.5)  is  exact.  Next,  we  give  the  error 
estimate  for  the  case  in  which  the  kernel  is  replaced  by  its  finite-dimensional  approximation 
(i.e.  degenerate  kernel)  and  analyse  the  effect  of  the  forward  problem  discretization  on  the 
accuracy  of  the  reconstructed  image  without  projecting  (2.6). 


2.2.  Effect  of  inverse  problem  discretization 

Let  Xn  C  X  and  Y„  c  Y  denote  a  sequence  of  finite  dimensional  subspaces  of  dimension 
n  =  1,2,...,  spanned  by  first-order  Lagrange  basis  functions  {Li, . . . ,  Ln },  and  {x;)J,  p  = 
1 ,...,«,  be  the  set  of  collocation  points  on  Then,  the  discretization  of  the  inverse  problem 
(2.6)  by  projecting  it  onto  the  finite  dimensional  subspace  Yn  using  the  collocation  method 
approximates  the  solution  of  (2.6)  by  an  element  a'n  e  X„  which  satisfies 

(/CO  (xp)  =  y(xp),  p  =  l,...,n,  (2.10) 

where  we  express  a^(x),  x  e  Q  on  a  set  { Qm }  of  finite  elements  for  m  =  1, . . . ,  N&  such  that 
U^A  Qm  =  Q,  as  follows: 

n 

“n(X)  =  Ea*L*(X)'  (2-H) 

k=  1 

Equivalently,  the  collocation  method  can  be  interpreted  as  a  projection  with  the  interpolation 
operator  V„  :  Y  —>  Yn  defined  by  [13] 

n 

'P n  f  (x)  :=  f(xp)Lp(x),  xe£2,  (2.12) 

p=  i 

for  all  /  e  Y .  Then,  (2.10)  is  equivalent  to 

VnKaxn=Vny.  (2.13) 

Let  be  the  interpolant  of  ax  [5]  and  assume  that  ax  e  // 1  (Q).  Then,  the  interpolation 
error  ea  —  ax  —  \[r  on  each  finite  element  Qm  is  bounded  by 


1140 


M  Guven  et  al 


where  C  is  a  positive  constant  and  hm  is  the  diameter  of  the  smallest  ball  that  contains  the  mth 
element  Qm. 

Theorem  1  describes  the  effect  of  inverse  problem  discretization  on  the  accuracy  of  the 
reconstructed  optical  absorption  image. 

Theorem  1.  Let  gj ,  g*  be  the  solutions  of  the  variational  formulations  of  the  boundary  value 
problems  (2.1)-(2.2)  and  (2.3)-(2.4),  respectively.  The  error  between  the  solution  ak  of  (2.6) 
and  the  solution  of  (2.13)  is  bounded  by 

Na 

|  —  “»  ||/.i(Q)  ^  Cy/Va\\T  —  Tn\\y^Xn  llaAHl -m^m 

m=  1 

C  Na  Nd,Ns 

+  -||7;,||y^Xnmax||g*g7||Li(Q)^  WsUib,m\\oikh.mhm,  (2.15) 

,J  "1=1  i.j 

where  C  is  a  positive  constant,  V, n  is  the  volume  of  (2,  Tn  :  Y  X„  is  a  uniformly  bounded 
operator  given  by  Tn  =  (l  +  {VnA*aAa)~lVn  [9], 

Proof.  See  [9].  □ 

2.3.  Effect  of f oi-ward problem  discretization 

Let  { QJm }  denote  the  set  of  linear  elements  used  to  discretize  the  variational  formulation  of 
the  boundary  value  problem  (2. 1)— (2.2)  for  m  —  1, . . . ,  NJA;  such  that  |J™A  £2™  =  and 
hi,  be  the  diameter  of  the  smallest  ball  that  contains  the  element  Q/„  in  the  finite-dimensional 
solution  Gj,  for  all  j  =  l, ...  ,NS  [9],  Similarly,  let  { Q'n }  denote  the  set  of  linear  elements 
used  to  discretize  the  variational  formulation  of  the  boundary  value  problem  (2.3)-(2.4)  for 
n  =  1,  . . . ,  Nf ;  such  that  [Ja'a  Q'h  =  Q,  and  h'n  be  the  diameter  of  the  smallest  ball  that 
contains  the  element  Q.'n  in  the  finite-dimensional  solution  G*,  for  all  i  =  1 ,  . . . ,  Nd  [9],  Then, 
a  bound  for  the  discretization  error  in  the  finite  element  solutions  Gj  and  G*  with  respect 
to  the  solutions  gj  and  g*  of  the  variational  formulations  of  the  boundary  value  problems 
(2.1)-(2.2)  and  (2.3)— (2.4)  on  each  finite  element  can  be  given  by  [5] 

II g;  -  G,-||0,my  <  C\\gj\\UmihJm,  (2.16) 

\\g*-G*\\o,n^C\\g*\\i,n‘K,  (2.17) 

where  C  is  a  positive  constant,  and  IHIo.mJ  (IHIo,«0  and  IMIi,mr  (INI i,„0  are  respectively  the  L 1 
and  // 1  norms  on  QJm(Q‘n). 

Consider  the  inverse  problem 

jCax  =  y,  (2.18) 

where  K,  and  y  are  the  finite  dimensional  approximations  to  K.  and  y,  obtained  by  substituting 
gj  and  g*  in  //,  ;  and  H*  -  by  Gj  and  G* ,  respectively. 

Theorem  2  shows  the  effect  of  forward  problem  discretization  on  the  accuracy  of  the 
reconstructed  optical  absorption  image. 

Theorem  2.  A  bound  for  the  error  between  the  solution  ak  of  (2.6)  and  the  solution  ax  of 
(2.18)  due  to  approximations  1C  and  y  is  given  by 
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c 


Nd  JVJ.JV, 


-  0(^11  ti(£2)  <  -max||g*gy||Li(n)  V]  V  (2||^ax||0,B<  +  ||Qt||ooll^llo,n')llft*lli,B'^ 

A  i,J  \  . 


i>= 1  «J 

Ns  N{,Nd 

+  J2J2  (2ii«r«Aiio.w  +  ii«iiookriio^)ii^iii,„/^  I , 


(2.19) 


7=1  m.i 

where  C  is  a  positive  constant. 


Proof.  See  [9]. 


□ 


We  refer  to  part  I  [9]  for  further  details  of  the  discussion  regarding  the  definition  and 
discretization  of  the  forward  and  inverse  problems.  In  the  following,  we  discuss  the  adaptive 
mesh  generation  for  the  forward  and  inverse  problems. 


3.  Adaptive  mesh  generation 

In  this  section,  we  discuss  the  adaptive  mesh  design  for  the  discretization  of  the  forward  and 
inverse  problems  based  on  theorems  1  and  2.  For  each  problem,  we  present  an  adaptive  mesh 
generation  algorithm,  which  is  followed  by  the  corresponding  computational  cost  analysis. 


3.1 .  Adaptive  mesh  generation  for  the  forward  problem 

Let  the  mesh  parameter  liJm  for  Gj,j  =  1  ,...,NS,  and  the  mesh  parameter  h'n  for 
G*,i  =  1, . . . ,  Nd  be  chosen  so  that 


hi,  < 


hi  ^ 


_ L/ _ _ 

+  Il«llook*llov)k7lll.^  J  ’ 


"  E7=i(2|I^«AHo,-<  +  INIoollgjIlo.nOllgfllty 


:=  BJ", 


(3.20) 

(3.21) 


where  the  tolerance  c f  will  be  defined  later.  Then,  by  theorem  2,  the  error  in  the  reconstructed 
image  due  to  the  forward  problem  discretization  is  bounded  by 


C 

—  max  Ilgfgy'llL'tfi) 
A  i,j 


Nd 


i=i 


e/  =  ef, 


(3.22) 


where  C  is  a  positive  constant  and  is  the  total  allowable  error  in  the  reconstructed  optical 
image  due  to  the  forward  problem  discretization.  Equation  (3.22)  implies  the  following  value 
for  €/: 


e/ 


Xef/C 


htgjWLHviEjUNL+z^Nzy 


max, 


(3.23) 


Algorithm  1  outlines  the  adaptive  mesh  generation  algorithm  for  the  forward  problem  in 
the  form  of  a  pseudocode.  The  algorithm  is  performed  for  each  source  and  detector  before  the 
linearization  of  the  inverse  problem  and  it  yields  a  family  of  adaptively  refined  meshes  with 
conforming  elements.  We  use  Rivara’s  algorithm  [15]  for  refinement. 
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Algorithm  1.  The  pseudocode  for  the  mesh  generation  algorithm  for  the  forward  problem, 
prior  to  the  linearization  of  the  inverse  problem. 

o  Generate  an  initial  uniform  mesh  (A, A/a),  A  =  U^fA*} 
o  Set  e-f 

o  Initialize  the  set  of  marked  elements:  Me  {} 
o  flag  =  True 
while  flag  =  True 

for  each  element  A,,  e  Awith  mesh  parameter  hj,  (/;',) 
if  first  linearization 

.  Use  analytical  solutions  for  gj  and  g*  and  a  priori  anatomical 
information  about  a  to  compute  the  bound  BJ  in  (3.20)  (B*n  in  (3.21)) 
else 

.  Use  current  solution  updates  Gj  and  G*  and  a'n 
to  compute  BJ  in  (3.20)  (B*n  in  (3.21)) 

end 

if  hi  >  BJ  (h‘e  >  B*n) 

■  Me  <-MfU!Ae} 

end 

end 

if  Me  /  {} 

.  Refine  the  marked  elements  and  update  the  mesh  A 
■  Me  <-  {} 
else 

.  flag  =  False 

end 

end 

o  Solve  for  Gj  (G*). 


Remark  1. 

(i)  In  practice,  BJ  and  B*n  in  (3.20) — (3.21)  cannot  be  computed  since  a,  a* (i) (ii) * * * * * * * x,gj  and  g* 
are  unknown.  However,  BJ  and  B*n  can  be  estimated  by  using  approximations  for  the 
functions  involved  in  these  bounds,  based  on  either  a  priori  information  or  on  the  recent 
forward  and  inverse  problem  solution  updates.  Then,  the  elements  whose  mesh  parameter 
h]m  (h‘n)  exceeds  BJ  ( B*nJ  can  be  determined  and  refined. 

(ii)  After  the  first  sweep  of  refinement,  one  can  compute  the  bound  BJ  and  B™  only  for 

the  new  elements.  We  note  that  for  the  initial  mesh  design,  we  use  a  model  problem 

to  compute  the  terms  in  the  error  bound  relevant  to  the  forward  problem  solution  (see 

appendix).  If  there  is  no  a  priori  information,  ak  can  be  assumed  to  be  spatially  constant 

at  the  first  linearization  step.  After  the  first  linearization,  the  norms  in  BJ  and  B*n 

relevant  to  gj  and  g*  are  not  expected  to  change  significantly.  In  this  context,  the  terms 

II II o,mi ,  II £./ a* ||oy  in  (3.20)  and  (3.21)  can  be  bounded  by  ||£*||0,my Halloo, mi  and 
|| gy || oy  II®'*' II oo,n‘,  respectively.  Therefore,  one  can  store  the  norms  Hgjlloy  and  ||g*||o,mj 

at  the  end  of  the  first  mesh  generation,  and  update  BJ  and  B*n  in  the  following  mesh 

generations  by  using  these  stored  values  and  the  updated  ak  values. 
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(iii)  In  case  e f  cannot  be  chosen  in  prior,  we  consider  a  posterior  approach,  set  Cf  —  1 ,  and 
compute  h,n  /  BJ  (h‘n  /  B*'1}  on  each  element,  which  is  used  as  the  indicator  for  refinement. 

Then,  the  elements  with  indicator  value  exceeding  the  average  hJm  /  BJ  (h‘n/ /?*”)  quantity 
are  marked  for  refinement.  We  note  that  in  this  case,  the  algorithm  has  to  be  stopped 
when  the  number  of  nodes  in  the  mesh  exceeds  the  allowable  number  of  nodes. 


3.2.  Computational  cost  of  the  adaptive  mesh  generation  algorithm  for  the  forward  problem 

Consider  the  algorithm  described  in  remark  l(iii)  for  £2  C  R2.  Using  triangular  finite  elements 
with  first-order  Lagrange  basis  functions  and  an  analytical  (exact)  integration  on  each  finite 
element,  the  number  of  multiplications  required  to  compute  the  L 2  or  //  norm  of  a  finite¬ 
dimensional  function  on  each  triangular  element  Q.]m  (C'J  is  12.  On  the  other  hand,  computing 
the  norm  ||gjcC||o,„i  (ll#*“*llo,mj)  takes  ten  times  the  number  of  multiplications  to  compute 
II#./ Ill, (II sf  II i,n* )■  As  a  result,  the  total  number  of  multiplications  required  to  compute 
the  error  estimates  on  all  finite  elements  for  the  jth  source  is  given  by  ( 1 32/V,/  +  16)  NA. 
Similarly,  the  total  number  of  multiplications  required  to  compute  the  error  estimates  on  all 
finite  elements  for  the  ith  detector  is  equal  to  ( 1 32/Vv  +  16 )NA. 

In  order  to  reduce  the  computational  cost  of  the  proposed  adaptive  mesh  generation 
algorithm,  we  can  approximate  the  bounds  BJ  in  (3.20)  and  B*n  in  (3.21)  as  follows: 


B": 


e,=i  #>iom; + ii  «ii  *  i  i  #ri0mj)ii#iiii 


BT 


'E%igj<x*‘\\0ni  +  Halloo |  Ef=i#/||0n<) lift* lli.»' 


(3.24) 

(3.25) 


Then,  the  number  of  multiplications  required  to  compute  the  error  estimates  on  all  finite 
elements  becomes  148 NJA  (148VJ'),  which  implies  a  significant  reduction  as  compared  to 
(132 Nd  +  16 )Nja  ((132 .N,  +  16 )N‘*). 

If  one  uses  the  discretization  error  estimates  (2. 16) — (2. 17)  to  generate  adaptive  meshes 
for  the  discretization  of  (2.1)-(2.2)  and  (2.3)-(2.4),  the  number  of  multiplications  is  equal 
to  13 Na  and  13 NA,  respectively.  Then,  the  resulting  adaptive  meshes  will  lead  to  finite 
element  solutions  Gj  and  G*  with  reduced  discretization  error.  However,  reduction  in  the 
discretization  error  in  Gj  and  G*  may  not  ensure  the  accuracy  of  the  reconstructed  absorption 
image  (see  simulation  experiment  3). 


3.3.  Adaptive  mesh  generation  for  the  inverse  problem: 

Let  the  mesh  parameter  hm  for  the  solution  of  the  inverse  problem  be  defined  as  follows: 

\AE  l|T—  X,||y_s.xJ|aA1ll,m  +  -\\rn\\Y^Xn 

A 

Nd,Ns  \ 

x  max  ||#*#j||l1(S2)  Yh  ll#,*#yllo,ml|aA||i,m  j  :=  C-  (3-26) 

hj  / 

Then,  by  theorem  1 ,  the  error  in  the  reconstructed  image  due  to  inverse  problem  discretization 
is  bounded  by 


CNAeim  =  em\ 


(3.27) 


1144 


M  Guven  et  al 


where  C  is  a  positive  constant  and  emv  is  the  total  allowable  error  in  the  reconstructed  optical 
image  due  to  inverse  problem  discretization. 

We  present  the  pseudocode  for  our  adaptive  mesh  generation  algorithm  used  at  each 
linearization  of  the  inverse  problem  in  algorithm  2.  Similar  to  the  forward  problem 
discretization,  we  use  Rivara’s  algorithm  [15]  for  the  refinement  of  the  elements. 


Algorithm  2.  The  pseudocode  for  the  mesh  generation  algorithm  at  every  linearization  step 
of  the  inverse  problem. 

©  Generate  an  initial  uniform  mesh  (A ,iVA),  A  =  UmU^m} 

©  Set  (  m 

©  Initialize  the  set  of  marked  elements:  Me  <-  {} 
o  flag  =  True 
while  flag  =  True 

for  each  element  A,„  e  A  with  mesh  parameter  hm 
if  first  linearization 

.  Use  current  solution  updates  Gj  and  G*  and  a  priori  information 
about  a  to  compute  /i™v  in  (3.26) 
else 

.  Use  current  solution  updates  Gj  and  G*  and  af 
to  compute  li["K  in  (3.26) 

end 

if  hm  >  5"'v 
■  Me  <~Me  U[A,„] 

end 

end 

if  Me  ±  {} 

.  Refine  the  marked  elementsand  update  themesh  A 
■  Me  <-  {} 
else 

.  flag  =  False 

end 

end 

©  Solve  for  a'n. 


Remark  2. 

(i)  In  practice,  B in  (3.26)  cannot  be  computed  since  ax,gj,g*  and  Tn  are  unknown. 
Similar  to  the  approach  described  in  section  3.1,  we  can  compute  an  estimate  for  /i'"v  by 
using  the  unifonn  boundedness  of  the  operator  Tn  [9]  and  by  using  approximate  values 
for  the  functions  involved  in  /f '"v .  In  this  context,  we  use  either  a  priori  information  or  the 
recent  forward  and  inverse  problem  solution  updates  to  calculate  (3.26)  on  each  element. 
Then,  the  elements  with  the  mesh  parameter  hm  >  B-"K  are  determined  and  refined. 

(ii)  In  order  to  save  computations,  after  the  first  sweep  of  refinement,  one  can  compute  the 
bound  B'"w  only  for  the  new  elements.  Furthermore,  similar  to  the  approach  described 
in  section  3.1,  the  term  \\g*gj  \\o,m  in  (3.26)  can  be  stored  after  the  first  mesh  generation 
and  can  be  used  in  the  following  mesh  generations.  In  this  context,  the  bound  B"f  can  be 
updated  by  using  only  the  updated  |o'Xj  l  m  value. 
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(iii)  Note  that,  in  practice,  one  of  the  two  terms  in  the  denominator  of  Iff.  will  be  dominant 
depending  on  the  value  of  X.  Thus,  we  consider  only  the  dominant  term  for  the 
computation  of  Iff .  In  case  emv  cannot  be  chosen  in  prior,  we  consider  a  posterior 
approach,  set  elw  =  1  and  compute  \\ocx\\\,mhm  or  ^v'',,v'  \\gfgj\\o,m\\otx\\i,mhm  on  each 
element,  which  are  used  as  the  refinement  indicators.  Then,  the  elements  with  indicator 
value  which  exceeds  the  average  indicator  value  are  refined.  In  this  case,  the  algorithm 
has  to  be  stopped  when  the  number  of  nodes  in  the  mesh  exceeds  the  allowable  number 
of  nodes. 


3.4.  Computational  cost  of  the  adaptive  mesh  generation  algorithm  for  the  inverse  problem 

Consider  the  algorithm  stated  in  remark  2(iii)  for  C  R2  and  assume  that  the  second  term  in 
the  denominator  of  Bfv  (3.26)  is  dominant.  Using  triangular  finite  elements  with  first-order 
Lagrange  basis  functions  and  an  analytical  (exact)  integration  on  each  finite  element,  the  total 
number  of  multiplications  required  to  compute  the  error  estimates  on  all  finite  elements  is 
given  by  (120 NdNs  +  14)1Va. 

In  order  to  reduce  the  number  of  multiplications,  we  can  consider  an  approximation  for 
B"f.  as  follows: 


B" 


■'Nj.Ns 


Si  gj  | 


(3.28) 


Then,  the  number  of  multiplications  reduces  to  1 34 /VA. 

If  one  uses  the  interpolation  error  estimate  (2. 14)  to  generate  adaptive  meshes,  the  number 
of  multiplications  to  compute  the  error  estimates  on  all  finite  elements  will  be  1 3  /VA .  However, 
such  adaptive  meshes  may  not  help  reduce  the  error  in  the  reconstructed  optical  images, 
resulting  from  discretization  (see  simulation  experiment  3). 


4.  Numerical  experiments 

We  conduct  a  series  of  numerical  experiments  to  demonstrate  the  implications  of  theorems  1 
and  2,  and  to  present  the  effectiveness  of  the  proposed  adaptive  mesh  generation  algorithms. 
We  perfonn  our  experiments  in  2D  for  ease  of  comparison. 

In  the  first  simulation,  we  consider  a  series  of  image  reconstructions  to  show  the 
effectiveness  of  the  proposed  adaptive  mesh  generation  algorithms.  In  this  context,  we 
compare  the  images  reconstructed  by  using  uniform  meshes  for  the  forward  and  inverse 
problems  to  the  images  reconstructed  by  using  adaptive  meshes  which  are  designed  based  on 
theorems  1  and  2. 

In  the  second  simulation,  we  show  the  effect  of  the  heterogeneity  size  on  the  accuracy  of 
the  reconstructed  absorption  images.  Next,  we  demonstrate  how  this  error  can  be  addressed 
by  the  proposed  adaptive  discretization  schemes. 

In  the  final  simulation  study,  we  demonstrate  the  implication  of  theorem  2  and  show 
that  meshes  generated  for  the  forward  problem  by  using  discretization  error  estimates  which 
disregard  the  interaction  between  the  solutions  gj ,  g*  and  a'  can  lead  to  unstable  image 
reconstructions.  We  note  that  the  proposed  adaptive  mesh  generation  algorithm  for  the  forward 
problem  addresses  this  problem. 

Note  that  in  all  experiments,  we  use  triangular  finite  elements  with  first-order  Lagrange 
basis  functions.  We  apply  Gaussian  elimination  method  to  solve  the  discrete  forward  problem 
resulting  from  the  variational  formulation  [5]  of  the  boundary  value  problems  (2.1)-(2.2) 
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(a)  (b)  (c) 

Figure  1.  The  setups  used  for  the  simulation  studies  1,  2  and  3.  The  squares  and  triangles  denote 
the  detectors  and  sources,  respectively,  (a)  The  optical  domain  and  source-detector  configuration 
for  simulation  study  1 .  (b)  The  optical  domain  and  source-detector  configuration  for  simulation 
study  2.  n  =  0.50  cm,  7*2  =  0.75  cm,  7*3  =  1.0  cm  and  7*4  =  1.25  cm.  (c)  The  optical  domain  and 
source-detector  configuration  for  simulation  study  3.  The  radius  of  the  circles  is  0.75  cm. 


and  (2.3)-(2.4)  [9].  For  the  inverse  problem,  we  consider  the  discrete  problem  obtained  by 
projecting  (2.18)  by  the  collocation  method  [9]: 

Vntaxn=V„Y,  (4.29) 

where  the  regularization  parameter  is  chosen  as  small  as  possible,  yet  large  enough  to  enable 
robust  image  reconstructions.  In  this  respect,  an  appropriate  value  for  the  regularization 
parameter  is  chosen  based  on  experience.  The  discrete  inverse  problem  (4.29)  is  solved  using 
Gaussian  elimination  as  well. 

4.1.  Simulation  study  1 

In  this  simulation  study,  we  consider  the  geometry  shown  in  figure  1(a).  We  simulate  the 
optical  data  by  solving  the  diffusion  equation  at  co  =  0  on  a  fine  uniform  grid  with  8 1  nodes 
along  the  x  and  y  directions,  where  the  refractive  index  mismatch  parameter  a  =  3.11  sources 
and  1 1  detectors  are  evenly  spaced  on  the  bottom  and  top  edges  of  the  square,  respectively. 
The  diffusion  coefficient  D(x)  =  0.0410  for  x  e  Q  U  i)Q.  The  circular  heterogeneity  with 
absorption  coefficient  /ia  =  0.2  cm-1  is  embedded  in  an  optically  homogeneous  background 
with  yu,fl  =  0.04  cm-1. 

In  order  to  obtain  a  series  of  absorption  imaging  problems  using  the  same  setup,  we 
consider  five  values  for  the  background  absorption  value.  Then,  for  each  imaging  problem, 
we  consider  three  mesh  scenarios:  uniform  mesh  for  both  forward  and  inverse  problems; 
adaptive  mesh  for  the  forward  problem  and  uniform  mesh  for  the  inverse  problem;  and 
adaptive  meshes  for  both  forward  and  inverse  problems.  We  refer  to  table  2  for  a  brief  outline 
of  the  first  simulation  study. 

The  uniform  mesh  used  for  the  forward  problem  discretization  has  625  nodes  and  is 
shown  in  figure  2(a).  The  uniform  mesh  for  the  inverse  problem  has  313  nodes  and  is  shown  in 
figure  2(b).  We  use  the  algorithms  described  in  section  3.1  and  remark  l(iii),  and  section  3.3 
remark  2(iii)  to  generate  the  adaptive  meshes  for  the  forward  and  inverse  problems, 
respectively.  The  number  of  nodes  in  each  of  the  adaptive  meshes  used  for  the  forward  problem 
does  not  exceed  750.  An  example  for  the  adaptive  mesh  generated  for  a  source  located  at 
(1.0,  0)  is  shown  in  figure  2(c).  The  adaptive  mesh  for  the  inverse  problem  generated  for  the 
case  where  the  background  /i„  =  0.050  cm-1  has  418  nodes  and  is  shown  in  figure  2(d). 
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Figure  2.  Examples  of  meshes  used  in  the  first  simulation  study,  (a)  The  uniform  mesh  with  625 
nodes,  (b)  The  uniform  mesh  with  313  nodes,  (c)  The  adaptive  mesh  generated  for  the  forward 
problem  for  the  source  located  at  (1.0,0):  background  /ia  =  0.050  cm-1,  (d)  The  adaptive  mesh 
generated  for  the  inverse  problem  solution,  with  418  nodes.  Background  na  =  0.050  cm-1. 


Table  2.  The  mesh  scenarios  and  the  background  /ia  values  in  simulation  study  1 . 


Mesh  (forward) 

Mesh  (inverse) 

Background  \xa  (cm  1 ) 

Uniform 

Adaptive 

Adaptive 

Uniform 

Uniform 

Adaptive 

0.032,  0.036,  0.040,  0.044,  0.050 
0.032,  0.036,  0.040,  0.044,  0.050 
0.032,  0.036,  0.040,  0.044,  0.050 

For  the  inverse  problem,  we  set  the  regularization  parameter  X  to  10”  7  in  all  experiments  to 
eliminate  the  dependence  of  the  error  estimates  (2. 15)— (2. 19)  on  the  regularization  parameter. 
We  consider  the  image  reconstructed  by  using  fine  uniform  meshes  (61  x  61  nodes  for  the 
forward  problem  and  61x61  nodes  for  the  inverse  problem)  as  the  reference  image  ax,  which 
is  assumed  to  possess  no  error  due  to  discretization.  We  compute  the  error  |aA  —  ax  ||  /  un. 
for  each  image  reconstruction  and  tabulate  the  results  in  table  3.  We  see  that  the  error  in 
the  images  reconstructed  by  using  uniform  meshes  for  both  forward  and  inverse  problems  is 
significantly  reduced  by  the  use  of  adaptively  refined  meshes.  A  similar  behaviour  is  observed 
for  all  choices  of  background  absorption  value. 
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Figure  3.  The  reconstruction  results  of  simulation  study  1,  with  the  background  fia  =  0.032  cm-1 . 
(a)  The  optical  absorption  image  used  as  the  reference  for  error  computations,  (b)  The  reconstructed 
absorption  image  using  the  uniform  mesh  in  figure  2(a)  for  the  forward,  and  the  uniform  mesh  in 
figure  2(b)  for  the  inverse  problem,  (c)  The  reconstructed  absorption  image  using  an  adaptive  mesh 
for  the  forward,  and  the  uniform  mesh  in  figure  2(b)  for  the  inverse  problem,  (d)  The  reconstructed 
absorption  image  using  an  adaptive  mesh  for  the  forward,  and  the  adaptive  mesh  in  figure  2(d)  for 
the  inverse  problem. 


Table  3.  The  error  ||o4  —  a£|lz,i(n)  for  each  experiment  described  in  the  simulation  study  1  and 
table  2.  The  first  column  shows  the  type  of  the  meshes  used  in  the  forward  and  inverse  problems, 
respectively.  The  unit  of  background  /ia  is  cm-1. 


Background  na : 

0.032 

0.036 

0.040 

0.044 

0.050 

Uniform-uniform 

ii“*- 

"  “n  Hi1  (£2)  : 

0.2325 

0.2559 

0.2773 

0.2932 

0.3013 

Adaptive-uniform 

11“*  - 

"  “n  HlOSJ)  : 

0.1238 

0.1139 

0.1166 

0.1209 

0.1278 

Adaptive-adaptive 

ii“*- 

"  “n  II L1  (£2)  : 

0.1043 

0.0997 

0.0998 

0.1003 

0.1009 

We  present  image  reconstructions  in  figures  3  and  4  for  the  two  extreme  cases,  where 
the  background  absorption  value  is  equal  to  0.032  and  0.050  cm-1,  respectively.  Figures  3(a) 
and  4(a)  display  the  reference  images  used  to  compute  the  error  values  given  in  table  3. 
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(c)  (d) 

Figure  4.  The  results  of  simulation  study  1,  with  the  background  /ia  =  0.050  cm-1,  (a)  The 
optical  absorption  image  used  as  the  reference  for  error  computations,  (b)  The  reconstructed 
absorption  image  using  the  uniform  mesh  in  figure  2(a)  for  the  forward,  and  the  uniform  mesh  in 
figure  2(b)  for  the  inverse  problem,  (c)  The  reconstructed  absorption  image  using  an  adaptive  mesh 
for  the  forward,  and  the  uniform  mesh  in  figure  2(b)  for  the  inverse  problem,  (d)  The  reconstructed 
absorption  image  using  an  adaptive  mesh  for  the  forward,  and  the  adaptive  mesh  in  figure  2(d)  for 
the  inverse  problem. 


Figures  3(c)  and  (d)  show  that  the  optical  heterogeneity  is  resolved  better  by  using  adaptive 
meshes  as  compared  to  the  reconstructed  image  obtained  by  using  uniform  meshes,  which  is 
shown  in  figure  3(b).  These  results  are  consistent  with  the  error  values  given  in  table  3.  A 
similar  trend  is  seen  in  figures  4(c)  and  (d).  Note  that  the  number  of  nodes  in  the  adaptive 
meshes  is  almost  equal  to  the  number  of  nodes  that  the  uniform  meshes  have.  In  figure  5, 
we  show  the  cross-sectional  views  from  the  reconstructed  images.  We  see  that  the  use  of 
coarse  uniform  meshes  fails  to  resolve  the  circular  heterogeneity  especially  for  the  case  in 
which  the  background  /ia  =  0.032  cm  1 . 

4.2.  Simulation  study  2 

In  this  study,  we  consider  the  geometry  shown  in  figure  1(b).  To  simulate  the  optical  data, 
we  use  the  same  source-detector  configuration  considered  in  the  first  simulation  study.  We 
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Figure  5.  The  cross-sectional  views  from  the  reconstructed  images  in  simulation  study  1, 
corresponding  to  the  cases  where  the  background  jia  =  0.032  and  /r„  =  0.050  cm~ 1 ,  respectively. 

(a)  The  cross-sectional  cuts  taken  from  figures  3(a),  (b)  and  (d),  along  the  .v-direction  at  y  =  3. 
The  solid,  square  and  diamond  lines  correspond  to  the  cross-sectional  cuts  taken  from  the  images 
shown  in  figures  3(a),  (b)  and  (d),  respectively,  (b)  The  cross-sectional  cuts  taken  from  figures  4(a), 

(b)  and  (d),  along  the  x-direction  at  y  =  3.  The  solid,  square  and  diamond  lines  correspond  to  the 
cross-sectional  cuts  taken  from  the  images  shown  in  figures  4(a),  (b)  and  (d),  respectively. 


simulate  the  optical  data  by  solving  the  diffusion  equation  at  co  =  0  on  a  fine  uniform  grid 
with  8 1  nodes  along  the  x  and  y  directions,  where  the  refractive  index  mismatch  parameter 
a  =  3.  The  diffusion  coefficient  D  is  assumed  to  be  constant  and  D(x)  =  0.0410  cm,  for  all 
x  e  Q,  U 

We  consider  four  different  radii  for  the  circular  heterogeneity  with  //,,  =  0.20  cm-1 
embedded  in  a  background  with  /ia  =  0.040  cm  1  as  shown  in  figure  1(b).  For  each  case,  we 
compute  the  error  for  different  mesh  scenarios,  similar  to  the  first  simulation  study:  uniform 
mesh  for  both  forward  and  inverse  problems;  adaptive  mesh  for  the  forward  problem  and 
uniform  mesh  for  the  inverse  problem;  and  adaptive  meshes  for  both  forward  and  inverse 
problems.  The  adaptive  meshes  for  this  simulation  study  were  generated  based  on  theorems  1 
and  2,  and  the  mesh  generation  algorithms  described  in  the  first  simulation  study  and  section  3. 
The  uniform  meshes  used  for  the  forward  and  inverse  problems  are  identical  to  those  used  in 
the  first  simulation  study.  We  note  that  the  number  of  nodes  in  the  adaptive  meshes  generated 
for  the  forward  and  inverse  problems  is  close  to  the  number  of  nodes  in  the  corresponding 
uniform  meshes. 

In  table  4,  we  tabulate  the  error  norm  ||  a/'  —  a'n  ||  obtained  for  each  heterogeneity  size 
with  different  mesh  choices,  where  a'  is  the  reference  image  reconstructed  by  using  fine 
uniform  meshes  as  in  the  first  study.  Table  4  shows  that  the  error  increases  with  increasing 
heterogeneity  size.  We  see  that  the  reduction  in  the  error  as  a  result  of  using  adaptive  meshes 
is  more  significant  for  smaller  sized  heterogeneities.  Further  reduction  in  the  error  norm 
|  ax  —  a'n  |  is  possible  by  increasing  the  number  of  nodes  in  the  meshes. 

For  brevity,  we  only  show  the  reconstruction  results  for  the  extreme  cases:  r  —  0.5  cm  and 
r  =  1.25  cm.  We  note  that  the  regularization  parameter  X  =  5  x  10~9  in  all  reconstructions. 
Figures  6(a)  and  (b)  show  the  images  used  as  the  reference  images  a'-  in  the  calculation  of 
the  error  norms  |a';‘  —  a'n  ||  listed  in  table  4.  Figures  6(e)-(f)  show  that  the  adaptive  meshes 
reduce  the  artefacts  as  compared  to  the  images  reconstructed  by  using  uniform  meshes,  which 
are  shown  in  figures  6(c)-(d). 
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Table  4.  The  L1  norm  of  a'  and  the  error  ||a!A  —  aA  ||  r  i  (Q)  for  each  experiment  described  in  the 
simulation  study  2.  The  first  column  shows  the  type  of  the  meshes  used  in  the  forward  and  inverse 
problems,  respectively.  The  radius  of  the  circular  heterogeneity  is  given  in  cm. 


Radius: 

0.50 

0.75 

1.0 

1.25 

0.7196 

1.3760 

1.4759 

1.7817 

U  niform-uniform 

H“X-“nllLl(S2): 

0.5622 

0.5706 

0.5850 

0.6337 

Adaptive-uniform 

0.2153 

0.2776 

0.3766 

0.5113 

Adaptive-adaptive 

II“X-“»IIl1(S2): 

0.2020 

0.2630 

0.3592 

0.5034 

4.3.  Simulation  study  3 

In  this  simulation  study,  we  consider  the  geometry  shown  in  figure  1(c).  The  centre  of 
the  circular  heterogeneity  is  moved  vertically  towards  the  detector  side  to  see  the  effect  on 
the  imaging  accuracy.  Next,  we  show  how  the  error  in  the  reconstructed  images  due  to 
discretization  can  be  addressed  by  using  appropriate  meshes  for  the  solutions  of  the  forward 
and  inverse  problems.  In  this  context,  we  compare  the  results  obtained  by  using  (1)  uniform 
meshes,  (2)  the  adaptive  meshes  generated  using  conventional  a  priori  discretization  error 
estimates,  and  (3)  the  adaptive  meshes  proposed  in  this  study.  By  conventional  error  estimates, 
we  mean  the  a  priori  discretization  error  estimates  (2.16)  and  (2.17)  for  the  solution  of  the 
forward  problem,  and  the  a  priori  interpolation  error  estimate  (2.14)  for  the  solution  of  the 
inverse  problem. 

To  simulate  the  optical  data,  we  use  the  same  source-detector  configuration  considered 
in  the  first  simulation  study.  We  simulate  the  optical  data  by  solving  the  diffusion  equation  at 
Co  =  0  on  a  fine  uniform  grid  with  8 1  nodes  along  the  x  and  y  directions,  where  the  refractive 
index  mismatch  parameter  a  =  3.  We  note  that,  in  all  reconstructions,  the  background 
absorption  value  is  set  to  /ia  =  0.04  cm-1  and  the  diffusion  coefficient  D  is  assumed  to  be 
constant  and  D(x)  =  0.0410  cm,  for  all  x  e  £2  U  3f2. 

The  uniform  meshes  used  in  this  simulation  study  are  identical  to  those  used  in  simulation 
studies  1  and  2.  Sample  meshes  for  the  forward  problem  solution  using  the  conventional  and 
the  proposed  adaptive  meshing  strategies  are  shown  in  figures  7(a)  and  (b)  and  figures  8(c) 
and  (d),  respectively.  We  see  that  the  conventional  adaptive  mesh  generation  strategy  leads  to 
meshes  refined  around  only  sources  or  detectors,  but  not  both.  In  contrast,  figures  8(c)  and  (d) 
show  that  the  proposed  strategy  results  in  adaptive  meshes  refined  around  sources,  detectors 
and  the  heterogeneity  as  well.  This  observation  is  consistent  with  theorem  2.  The  adaptive 
mesh  for  the  inverse  problem  solution,  which  was  generated  using  the  a  priori  interpolation 
error  estimate  (2.14),  is  shown  in  figure  8(f).  Note  that  the  mesh  was  generated  for  the 
case  where  the  circular  heterogeneity  was  centred  at  (3.0,  3.5).  The  mesh  generated  based 
on  theorem  1  (figure  8(e))  provides  higher  resolution  close  to  the  sources  and  detectors  as 
compared  to  the  mesh  shown  in  figure  8(f),  which  is  merely  refined  around  the  heterogeneity. 

In  this  simulation  study,  we  consider  four  different  positions  for  the  centre  of  the  circular 
heterogeneity  with  radius  0.75  cm,  along  the  y-axis:  centre  at  (3.0,  3.0),  (3.0,  3.5),  (3.0, 
4.0)  and  (3.0,  4.5),  respectively.  Similar  to  the  previous  simulations,  we  compute  the  error 
in  the  reconstructed  images  for  all  cases,  and  compare  the  error  values  attained  by  different 
meshing  strategies.  Finally  we  present  the  reconstructed  images  obtained  by  using  different 
mesh  strategies  corresponding  to  the  case  where  the  circular  inclusion  is  centred  at  (3.0,  3.5) 
and  (3.0,  4.0). 
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Figure  6.  The  results  of  simulation  study  2.  The  left  and  right  columns  show  the  reconstructed 
images  regarding  the  optical  heterogeneity  with  radius  0.50  cm  and  1 .25  cm,  respectively.  The 
background  /ia  =  0.040  cm-1  in  all  of  the  reconstructions.  The  reference  images  shown  in 
(a)  and  (b)  are  obtained  using  a  uniform  mesh  with  61x61  nodes  in  both  the  forward  and  inverse 
problems,  ((a)  and  (b))  The  optical  absorption  images  used  as  the  reference  for  error  computations. 
The  images  correspond  to  the  reconstruction  of  the  circular  heterogeneities  of  radii  0.5  cm  and 
1.25  cm,  respectively,  ((c)  and  (d))  The  reconstructed  absorption  images  using  the  uniform  mesh 
in  figure  2(a)  for  the  forward,  and  the  uniform  mesh  in  figure  2(b)  for  the  inverse  problem. 
((e)  and  (f))  The  reconstructed  absorption  image  using  adaptive  meshes  for  both  the  forward  and 
the  inverse  problems. 
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Figure  7.  ((a)  and  (b))  Samples  of  adaptive  meshes  in  the  third  simulation  study  (with  865  nodes 
for  the  source  and  the  detector  located  at  (2.0, 0)  and  (4.0, 6.0),  respectively),  generated  by  using  the 
conventional  error  estimates  (2.16)  and  (2.17),  which  led  to  unstable  optical  image  reconstruction 
shown  in  (c)  to  (f),  for  the  circular  heterogeneity  centred  at  (3.0,  3.5).  ((c)  and  (d))  The  unstable 
optical  image  reconstructions  in  the  third  simulation  study,  obtained  by  using  the  adaptive  meshes 
for  the  forward  problem  solution  whose  examples  are  shown  in  (a)  and  (b)  (X  =  10~8).  ((e)  and 
(f))  The  unstable  optical  image  reconstructions  in  the  third  simulation  study,  obtained  by  using  the 
adaptive  meshes  for  the  forward  problem  solution  whose  examples  are  shown  in  (a)  and  (b).  X  was 
set  to  10~6  to  suppress  the  significantly  large  artefacts  observed  in  (c)  and  (d). 


Using  the  meshes  for  the  forward  problem  discretization  (see  figures  7(a)  and  (b)), 
which  were  generated  by  using  the  conventional  a  priori  discretization  error  estimates 
(2. 16)— (2. 17),  leads  to  the  image  reconstructions  shown  in  figures  7(c)  and  (d),  where  the 
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Figure  8.  Samples  of  adaptive  meshes  used  in  the  third  simulation  study,  which  led  to  the  optical 
image  reconstructions  shown  in  figure  10.  The  meshes  were  generated  for  the  circular  heterogeneity 
centred  at  (3.0,  4.0).  (a)  The  adaptive  mesh  with  942  nodes  for  the  forward  problem  solution  for 
the  source  located  at  (2.0,  0),  obtained  by  refining  the  adaptive  mesh  shown  in  figure  7(a)  around 
the  detectors,  (b)  The  adaptive  mesh  with  955  nodes  for  the  forward  problem  solution  for  the 
detector  located  at  (4.0,  6.0),  obtained  by  refining  the  adaptive  mesh  shown  in  figure  7(b)  around 
the  sources,  (c)  The  adaptive  mesh  with  895  nodes  for  the  forward  problem  solution  for  the  source 
located  at  (2.0,  0),  generated  based  on  theorem  2.  (d)  The  adaptive  mesh  with  896  nodes  for  the 
forward  problem  solution  for  the  detector  located  at  (4.0,  6.0),  generated  based  on  theorem  2. 
(e)  The  adaptive  mesh  with  691  nodes  for  the  inverse  problem  solution,  generated  based  on 
theorem  1.  (f)  The  adaptive  mesh  with  609  nodes  for  the  inverse  problem  solution,  generated  based 
on  the  conventional  error  estimate  (2.14). 
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regularization  parameter  X  =  10~8.  We  observe  that  the  finite-dimensional  operator  does 
not  provide  a  stable  solution.  We  note  that  using  an  adaptive  mesh  for  the  inverse  problem 
solution  does  not  change  the  outcome  (figure  7(d)).  Note  also  that  the  meshes  generated  by 
using  the  conventional  a  priori  discretization  error  estimates  (2. 16)— (2. 17)  are  sufficient  to 
provide  accurate  finite  element  approximations  to  the  actual  solutions  gj  and  g*.  Therefore, 
the  unstable  reconstructions  can  be  attributed  to  the  errors  K.—K.  and  y  —  y ,  due  to  inappropriate 
discretization  as  noted  by  theorem  2.  In  consistence  with  theorem  2,  this  observation  suggests 
that  solving  the  forward  problem  accurately  does  not  necessarily  imply  that  approximate 
operator  K,  and  y  are  error-free.  Therefore,  in  order  to  address  such  problems,  one  has  to 
follow  a  discretization  scheme  based  on  theorem  2  for  the  solution  of  the  forward  problem, 
which  takes  into  account  the  interaction  between  the  solutions  of  the  diffusion  equation  and 
the  associated  adjoint  problem,  as  described  in  section  3.1. 

In  order  to  suppress  the  severe  artefacts  observed  in  figures  7(c)  and  (d),  we  increased 
the  regularization  parameter  and  set  X  =  10~6.  The  resulting  images  are  shown  in 
figures  7(e)  and  (f).  As  noted  by  theorems  1  and  2,  increasing  the  regularization  parameter 
reduces  the  error  in  the  reconstructed  images.  However,  increasing  the  regularization 
parameter  will  also  compromise  the  image  quality  and  lead  to  over-smoothed  images.  In 
order  to  address  the  instability  issue  without  degrading  the  image  quality  by  using  high 
regularization  parameters,  we  modified  the  adaptive  mesh  generation  method  that  leads  to  the 
meshes  shown  in  figures  7(a)  and  (b).  In  this  context,  for  the  first  2  refinements,  we  used 
the  proposed  mesh  generation  algorithm  based  on  theorem  2  to  generate  an  initial  adaptive 
mesh;  and  for  the  next  two  refinements,  we  used  the  conventional  error  estimates  (2. 16)— 
(2.17).  Following  this  modification,  the  samples  of  the  resulting  adaptive  meshes  are  shown  in 
figures  8(a)  and  (b).  For  a  comparison,  we  also  present  in  figures  8(c)  and  (d),  the  adaptive 
meshes  generated  by  using  the  proposed  adaptive  mesh  generation  algorithms  as  described 
in  section  3.1  and  remark  l(iii).  We  observe  that  the  meshes  shown  in  figures  8(c)  and 
(d),  indicate  further  refinement  around  sources,  detectors  and  the  circular  heterogeneity  as 
compared  to  the  adaptive  meshes  shown  in  figures  8(a)  and  (b). 

Examples  of  the  adaptive  meshes  generated  for  the  inverse  problem  based  on  theorem  1 
and  the  conventional  a  priori  interpolation  estimate  (2.14)  are  shown  in  figures  8(e)  and 
(f),  respectively.  We  observe  that  the  adaptive  mesh  shown  in  figure  8(e)  provides  higher 
resolution  around  sources  and  detectors  as  compared  to  the  adaptive  mesh  shown  in 
figure  8(f). 

We  note  that  the  uniform  meshes  used  in  this  simulation  study  are  identical  to  those  used 
in  the  previous  simulation  studies. 

In  order  to  compare  the  performance  of  the  conventional  and  proposed  adaptive  mesh 
strategies,  we  perform  four  experiments  and  compute  the  error  in  the  reconstructed  optical 
absorption  images.  For  each  experiment,  we  consider  five  different  mesh  strategies  and  refer 
to  table  5  for  the  description  of  these  experiments. 

We  show  the  reconstructed  optical  absorption  images  for  the  two  cases  in  figures  9  and  10, 
corresponding  to  the  circular  heterogeneity  centred  at  (3.0,  3.5)  and  (3.0,  4.0),  respectively. 
Figures  9(a)  and  10(a)  show  the  reference  absorption  image  reconstructions  which  are  used  to 
compute  the  error  in  the  reconstructed  optical  images. 

Figure  9(b)  shows  the  image  reconstructed  using  coarse  uniform  meshes  for  both  the 
forward  and  inverse  problems,  for  the  case  where  the  circular  inclusion  is  centred  at  (3.0, 
3.5)  where  the  regularization  parameter  was  set  to  X  =  10~8.  With  the  same  value  of  the 
regularization  parameter,  figure  9(c)  shows  the  reconstructed  image  by  using  the  adaptive 
mesh  based  on  theorem  2  for  the  forward  problem  and  the  coarse  uniform  mesh  (shown  in 
figure  2(b))  for  the  inverse  problem.  Figure  9(e)  shows  the  reconstructed  image  obtained 
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Table  5.  The  relevant  parameters  in  experiments  1—5  in  simulation  study  3.  The  abbreviation 
‘Conv.'  implies  that  the  corresponding  mesh  was  generated  using  the  conventional  a  priori 
discretization  error  estimates  (2. 16)— (2. 17)  for  the  forward  problem  solution,  and  the  conventional 
a  priori  interpolation  error  estimate  (2.14)  for  the  inverse  problem  solution.  The  abbreviation 
’Prop.’  refers  to  the  adaptive  meshes  generated  by  using  the  proposed  adaptive  mesh  generation 
algorithms  based  on  theorems  1  and  2,  for  the  inverse  and  forward  problem  solutions,  respectively. 
The  last  column  in  the  table  shows  the  coordinates  of  the  centre  of  the  circular  heterogeneity, 
considered  in  each  experiment. 


Mesh  (forward) 

Mesh  (inverse) 

Centre  at: 

Exp.  1 
Exp.  2 
Exp.  3 
Exp.  4 
Exp.  5 

Uniform 

Adaptive  (Conv.) 
Adaptive  (Conv.) 
Adaptive  (Prop.) 
Adaptive  (Prop.) 

Uniform 

Uniform 

Adaptive  (Conv.) 
Uniform 

Adaptive  (Prop.) 

[(3.0,  3.0),  (3.0,  3.5),  (3.0,  4.0),  (3.0,  4.5)] 
[(3.0,  3.0),  (3.0,  3.5),  (3.0,  4.0),  (3.0,  4.5)] 
1(3.0,  3.0),  (3.0,  3.5),  (3.0,  4.0),  (3.0,  4.5)] 
1(3.0,  3.0),  (3.0,  3.5),  (3.0,  4.0),  (3.0,  4.5)] 
1(3.0,  3.0),  (3.0,  3.5),  (3.0,  4.0),  (3.0,  4.5)] 

Table  6.  The  error  ||ax  —  || /  i  ^  for  each  experiment  described  in  the  simulation  study  3.  The 

first  column  shows  the  type  of  the  meshes  used  in  the  forward  and  inverse  problems,  respectively. 
The  superscript  ‘C’  denotes  that  the  corresponding  adaptive  mesh  generation  is  based  on  the 
conventional  a  priori  error  estimates  (2. 16) — (2. 17)  and  (2.14). 


Radius  at: 

(3.0,  3.0) 

(3.0,  3.5) 

(3.0,  4.0) 

(3.0,  4.5) 

Uniform-uniform 

ii 

—  “nlli1(S2): 

0.4539 

0.4606 

0.4733 

0.4956 

Adaptive-uniform 

ii 

~  “n HlMsU 

0.2690 

0.2695 

0.2634 

0.2507 

Adaptive-adaptive 

II 

~  “n  IIl1(S2): 

0.2433 

0.2455 

0.2459 

0.2434 

Adaptive-uniform 

ll«* 

-  “n^lli'cnF 

0.7989 

0.7596 

0.7072 

0.6418 

Adaptive-adaptive 

II  a* 

-  “h’C|Iz,1(S2): 

0.8011 

0.7614 

0.7070 

0.6351 

by  using  the  adaptive  meshes  based  on  theorems  1  and  2.  We  observe  the  improvements 
especially  around  the  boundaries.  Using  the  conventional  adaptive  meshes  for  the  forward 
problem  solution,  which  were  modified  around  sources  and  detectors  as  noted  before,  we 
ran  into  a  similar  instability  problem.  Therefore,  in  order  to  obtain  better  reconstructions 
with  the  conventional  adaptive  meshes,  we  set  the  regularization  parameter  X  =  1 0  7  in  the 
corresponding  inverse  problem  formulations.  The  resulting  reconstructed  images  are  shown 
in  figures  9(c)  and  (f).  In  this  case,  we  observe  that  the  use  of  conventional  adaptive  meshes 
for  the  forward  and  inverse  problems  does  not  improve  the  image  quality  as  compared  to  the 
reconstructed  image  shown  in  figure  9(b),  which  is  obtained  by  using  coarse  uniform  meshes. 

We  observe  similar  results  for  the  case  where  the  circular  inclusion  is  centred  at  (3.0,  4.0). 
We  note  that  the  regularization  parameter  is  set  to  X  =  1 0  x  for  all  reconstructions  except  for 
the  reconstructions  obtained  by  using  conventional  adaptive  meshes,  in  which  case  X  =  10  7. 
Figure  9  shows  the  reconstructed  images  corresponding  to  all  meshing  strategies. 

Table  6  shows  the  error  norm  a 1  —  6t„  computations  for  all  cases.  Similar  to  the 
previous  experiments,  a 1  is  the  reference  image  obtained  by  using  fine  uniform  meshes 
for  the  discretization  of  the  forward  and  inverse  problems.  The  error  values  are  consistent 
with  figures  9  and  10.  In  all  cases,  the  proposed  adaptive  meshes  significantly  reduce  the 
error  in  the  reconstructed  images.  Furthermore,  the  image  quality  is  enhanced  by  merely 
appropriate  discretization,  without  having  to  increase  the  regularization  parameter.  In  contrast, 
the  conventional  adaptive  meshes  perform  worse  than  uniform  meshes  even  though  a  higher 
regularization  parameter  is  used. 
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Figure  9.  The  reconstructed  optical  images  regarding  the  circular  heterogeneity  centred  at 
(3.0,  3.5)  in  the  third  simulation  study,  (a)  The  absorption  image  used  as  the  reference  in  the 
error  computations,  (b)  The  reconstructed  absorption  image  using  the  uniform  mesh  in  figure  2(a) 
for  the  forward,  and  the  uniform  mesh  in  figure  2(b)  for  the  inverse  problem,  (c)  The  reconstructed 
absorption  image  using  adaptive  meshes  based  on  theorem  2  for  the  forward,  and  the  uniform 
mesh  in  figure  2(b)  for  the  inverse  problem,  (d)  The  reconstructed  absorption  image  using  adaptive 
meshes  based  on  a  priori  error  estimates  (2.16)  and  (2.17)  for  the  forward,  and  the  uniform  mesh 
in  figure  2(b)  for  the  inverse  problem,  (e)  The  reconstructed  absorption  image  using  adaptive 
meshes  based  on  theorem  2  for  the  forward,  and  using  the  adaptive  mesh  based  on  theorem  1 
for  the  inverse  problem,  (f)  The  reconstructed  absorption  image  using  adaptive  meshes  based  on 
a  priori  error  estimates  (2. 16)  and  (2. 17)  for  the  forward,  and  the  interpolation  error  estimate  (2. 14) 
for  the  inverse  problem. 
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Figure  10.  The  reconstructed  optical  images  regarding  the  circular  heterogeneity  centred  at 
(3.0,  4.0)  in  the  third  simulation  study,  (a)  The  absorption  image  used  as  the  reference  in  the 
error  computations,  (b)  The  reconstructed  absorption  image  using  the  uniform  mesh  in  figure 
2(a)  for  the  forward,  and  the  uniform  mesh  in  figure  2(b)  for  the  inverse  problem,  (c)  The 
reconstructed  absorption  image  using  adaptive  meshes  based  on  theorem  2  for  the  forward,  and 
the  uniform  mesh  in  figure  2(b)  for  the  inverse  problem,  (d)  The  reconstructed  absorption  image 
using  adaptive  meshes  based  on  a  priori  error  estimates  (2.16)  and  (2.17)  for  the  forward,  and  the 
uniform  mesh  in  figure  2(b)  for  the  inverse  problem,  (e)  The  reconstructed  absorption  image  using 
adaptive  meshes  based  on  theorem  2  for  the  forward,  and  the  adaptive  mesh  based  on  theorem  1 
for  the  inverse  problem,  (f)  The  reconstructed  absorption  image  using  adaptive  meshes  based  on 
a  priori  error  estimates  (2. 16)  and  (2.17)  for  the  forward,  and  the  interpolation  error  estimate  (2. 14) 
for  the  inverse  problem. 


Effect  of  discretization  error  and  adaptive  mesh  generation  in  diffuse  optical  tomography:  II 


1159 


5.  Conclusion 

In  this  work,  based  on  the  error  analysis  presented  in  part  I  [9],  we  developed  two  new  adaptive 
mesh  generation  algorithms,  one  for  the  forward  and  one  for  the  inverse  problem,  which  take 
into  account  the  interdependence  between  the  solutions  of  the  two  problems.  We  have  also 
presented  the  computational  complexity  of  the  presented  adaptive  mesh  generation  algorithms. 
Our  numerical  experiments  provided  a  verification  of  theorems  1  and  2  and  showed  that  the 
proposed  mesh  generation  algorithms  significantly  improve  the  accuracy  of  the  reconstructed 
optical  images  for  a  given  number  of  unknowns  in  the  discrete  forward  and  inverse  problems. 
Conventional  error  estimates  do  not  include  domain-specific  factors.  As  a  result,  the  adaptive 
mesh  generation  algorithms  based  on  conventional  error  estimates  (2. 16) — (2. 17)  and  (2.14) 
may  lead  to  high  errors  in  reconstructed  optical  images  as  demonstrated  in  our  numerical 
experiments. 

We  finally  note  that  the  adaptive  mesh  generation  algorithms  introduced  in  this  paper  can 
be  adapted  for  the  forward  and  inverse  problems  of  similar  inverse  parameter  estimation 
problems,  such  as  electrical  impedance  tomography,  optical  fluorescence  tomography, 
bioluminescence  tomography,  microwave  imaging  etc. 
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Appendix.  Solution  of  the  model  problem 


In  order  to  initialize  the  adaptive  mesh  for  the  solution  of  the  forward  problem  (provided 
Z)(x)  =  D  and  p,a(x)  =  Jp,  are  spatially  constant),  we  use  an  analytical  solution  to  compute 
the  estimates  of  gj  and  g*.  Below,  we  give  the  solution  in  2D  for  (2.1).  Under  the  same 
conditions,  an  analytical  solution  for  the  adjoint  problem  (2.3)  can  be  obtained  in  a  similar 
way. 

First,  we  use  the  polar  coordinates  (p,  0)  to  rewrite  (2.1): 


1  3 
P  dp 


i  1  9  fJ8j\  i  ri 2 * 4r  4n  S(p  -  pJs)8(6  -6j) 

p  39  \  d0  )  a8j  p  D 


where  we  consider  an  unbounded  domain,  model  the  point  source  located  at  (pi,  0i)  by  the 
Dirac-delta  function  47r 3 (p —  pi) <5(0 —  0i)/p  and  K q  —  —  (pjjc-t-i&O/cZ).  Then,  the  solution 
gj  at  (p,  0)  due  to  the  point  source  located  at  (pi ,  0i)  is  given  by  [11] 


4  1 

8j{p,  Pi',d,  0/)  =  =-  |  -/ofep<)X0fep>)  +  2_)cos[m(0 
Dir  l  Z  m= 1 


0.v )  I  Cz  ( k<>  P  ■  )  (ko  p> ) 


where  p<  means  the  smaller  of  p  and  pi ,  p>  means  the  greater  of  p  and  pi ,  and  Km  are 
the  modified  Bessel  functions  of  the  first  and  second  kind,  respectively  [1]  and  kn  = 

The  solution  of  the  problem  in  3D  can  be  derived  in  a  similar  manner  [11,  17]. 
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Abstract — In  Diffuse  Optical  Tomography  (DOT),  the  dis¬ 
cretization  error  in  the  numerical  solutions  of  the  forward 
and  inverse  problems  results  in  error  in  the  reconstructed 
optical  images.  In  this  work,  based  on  the  analysis  presented 
in  [5],  we  present  two  theorems  that  constitute  the  basis  for 
adaptive  mesh  generation  for  the  forward  and  inverse  DOT 
problems.  The  proposed  discretization  schemes  lead  to  adaptively 
refined  composite  meshes  that  yield  the  desired  level  of  imaging 
accuracy  while  reducing  the  size  of  the  discretized  forward  and 
inverse  problems.  Our  numerical  experiments  validate  the  error 
estimates  developed  in  [5]  and  show  that  the  new  adaptive  mesh 
generation  algorithms  improve  the  accuracy  of  the  reconstructed 
optical  images. 

I.  Introduction 

Imaging  in  Diffuse  Optical  Tomography  (DOT)  is  composed 
of  two  interdependent  stages  which  seek  solutions  to  the 
forward  and  inverse  problems.  The  forward  problem  deals 
with  describing  the  Near  Infrared  (NIR)  light  propagation  and 
the  inverse  problem  is  concerned  with  the  estimation  of  the 
unknown  optical  parameters  from  boundary  measurements  [1], 

Numerical  approaches  in  solving  the  forward  and  inverse 
problems  in  Diffuse  Optical  Tomography  (DOT)  poses  a  trade¬ 
off  between  computational  efficiency  and  imaging  accuracy. 
This  tradeoff  is  a  direct  consequence  of  the  discretization  of 
the  forward  and  inverse  problems  [5],  [1]  and  the  size  of  the 
resulting  discrete  forward  and  inverse  problems.  Attempting 
to  minimize  the  discretization  error  in  the  solutions  of  both 
problems  separately  results  in  increased  size  of  the  discrete 
forward  and  inverse  problems.  Thus,  it  is  important  to  under¬ 
stand  the  relationship  between  the  discretization  error  and  the 
resulting  error  in  the  solution  of  the  inverse  problem. 

In  [5],  we  presented  an  error  analysis  that  shows  the  effect 
of  discretization  of  the  forward  and  inverse  problems  on  the 
accuracy  of  DOT  imaging.  In  this  work,  based  on  the  two 
error  bounds  provided  by  the  error  analysis  in  [5],  we  present 
an  adaptive  discretization  scheme  for  the  forward  and  inverse 
problems,  respectively.  We  remark  that  the  mesh  refinement 
criterion  for  each  problem  comprises  the  discretization  error 
in  the  corresponding  problem  solution,  scaled  spatially  by  the 
solutions  of  both  problems.  Thus,  the  proposed  adaptive  mesh 
generation  algorithms  address  the  interdependence  between 


the  solutions  of  the  forward  and  inverse  problems  and  take 
into  account  the  orientation  of  the  source -detectors  and  the  ab¬ 
sorptive  perturbations.  This  makes  the  adaptive  discretization 
algorithms  presented  in  this  paper  different  from  the  previous 
approaches  [4],  [9],  [3],  [7],  [8]  (see  [5],  [6]  for  an  extensive 
literature  survey).  The  simulation  experiments  validate  the 
implications  of  our  error  analysis  and  show  that  the  proposed 
mesh  generation  algorithms  significantly  improve  the  accuracy 
of  the  reconstructed  optical  images  for  a  given  number  of 
unknowns  in  the  discrete  forward  and  inverse  problems. 

The  outline  of  this  paper  is  as  follows:  In  Section  2, 
we  give  a  brief  overview  of  the  forward  and  inverse  DOT 
problems  and  recall  the  two  theorems  presented  in  [5]  which 
summarize  the  impact  of  discretization  on  the  accuracy  of  the 
reconstructed  optical  images.  Finally  we  discuss  the  adaptive 
mesh  generation  algorithms  followed  by  our  experimental 
results  and  the  conclusion  section. 


TABLE  I 

Index  of  notation. 


Notation 

Explanation 

ll/WHo 

The  (or  iTu(f2))  norm  of  /(x) 

\\m\\r 

The  norm  of  /(x) 

il/(x)iiz,p(n) 

The  LP(Q)  norm  of  /(x) 

ll/(x)llo,m 

The  L2  (or  H°)  norm  of  /(x)  over  the  mth 

finite  element  f2m 

||/(x)||p,m 

The  Hp  norm  of  /(x)  on  the  mth  finite  element  Qm 

/« 

The  complex  conjugate  of  /(x) 

II.  Overview 

In  this  section,  we  first  briefly  define  the  forward  and  inverse 
problems  in  DOT.  Next,  we  state  Theorems  1  and  2  presented 
in  [5]  to  recall  the  effect  of  the  discretization  of  the  forward 
and  inverse  problems  on  the  accuracy  of  optical  absorption 
image  reconstruction.  We  refer  to  Table  I  for  the  explanation 
of  the  notation  associated  with  functions  and  their  norms. 

A.  Forward  and  inverse  problems  in  DOT 

We  consider  the  following  boundary  value  problem  to  model 
the  near  infrared  light  propagation  in  a  bounded  domain  O  C 


R3,  with  Lipschitz  boundary  [2]: 

-V  •  D(x)Vgj(\)  +  (a ta(x)  +  =  Qj(x)  x  <=  Cl,  (1) 

ffj(x)  +  2 aZ?(x)^p-(x)  =  0  x  €  91],  (2) 

where  </j(x)  is  the  photon  density,  Qj(x)  is  the  point  source 
located  at  the  source  position  xj,  l)(x)  is  the  diffusion  coeffi¬ 
cient,  /ra(x)  is  the  absorption  coefficient,  u>  is  the  modulation 
frequency  of  the  source,  c  is  the  speed  of  the  light,  a  is  a 
parameter  governing  the  internal  reflection  at  the  boundary 
<911,  and  d  ■  /dn  denotes  the  directional  derivative  along 
the  unit  normal  vector  n  on  the  boundary.  The  boundary 
value  problem  (l)-(2)  constitutes  the  forward  problem  in  DOT 
together  with  the  associated  adjoint  problem  [1],  [5]: 

-V-  D(x)Xg*(x)  +  (pa(x)  -  —  )g*(x)  =  0  xg  Cl,  (3) 

c 

9*  (x)  +  2 aD  (x)  ^-(x)  =  Q*(x)  x  G  dCl ,  (4) 

where  Q*(x)  is  the  adjoint  source  located  at  the  detector 
position  xld.  Note  that  we  approximate  the  point  source  Qj 
in  (1)  and  the  adjoint  source  Q*  in  (4)  by  Gaussian  functions 
with  sufficiently  low  variance,  whose  centers  are  located  at  xJs, 
and  xld,  respectively  [5], 

In  this  work,  we  focus  on  the  estimation  of  the  absorp¬ 
tion  coefficient  and  consider  an  iterative  algorithm  based 
on  repetitive  linearization  of  the  inverse  problem  using  first 
order  Born  approximation.  Using  a  zeroth  order  Tikhonov 
regularization  to  address  the  illposedness,  the  inverse  problem 
at  each  iteration  reads: 

7(x)  =  A*aTij  =  (A*aAa  +  XI)ax  =  Kax,  (5) 
=  /  K(x,x)ax(x)dx  +  AaA(x)  (6) 

J  n 

where  f’jj  is  the  differential  measurement  at  the  ith  detector 
due  to  the  jth  source  as  a  result  of  the  small  perturbation  a(x) 
on  the  background  absorption  coefficient  /i0(x).  In  (6),  k(x,x) 
is  the  kernel  of  the  integral  equation,  given  by 

Nd,Ns 

«(M)  =  <7) 

where  H(xJs,xd-,x)  =  —g*(x,xld)g(x1xJs)  is  the  kernel  of 
the  integral  operator  Aa  :  L°°(Cl)  — »  (jNdxNs  an(j 

H*(x;xJs,xd)  =  —g*(x,xld)g(x,xJs)  is  the  kernel  of  the  adjoint 
operator  A*  :  CNdXN,s  — >  L1(Cl)  [6].  For  the  rest  of  the  paper, 
we  will  denote  L°°(Cl)  and  L1( Cl)  by  X  and  Y,  respectively. 

Below  we  summarize  the  two  theorems  of  [5]  and  provide 
the  error  estimates  which  will  be  used  in  the  design  of  adaptive 
meshes  for  the  discretization  of  the  forward  and  inverse  DOT 
problems. 

B.  Effect  of  inverse  problem  discretization 

Consider  the  discretization  of  the  inverse  problem  (5)  by 
projecting  it  onto  a  finite  dimensional  subspace  Yn,  using  the 


collocation  method  [5]: 

P„Kax  =  Pn  7,  (8) 

where  P„  :  Y  —>  Yn  is  the  projection  operator  associated  with 
the  collocation  method  with  piecewise  linear  Lagrange  basis 
functions  Lp(x)  [5]  such  that  a„(x),  x  G  Cl,  is  approximated 
on  a  set  {flm}  of  finite  elements  for  m  =  1,---,7Va, 
|J^A  Clm  =  Cl  as  follows: 

n 

«n(x)  =  ^apLp(x).  (9) 

p=  i 

Let  f>  be  the  interpolant  of  ax  [2]  and  assume  that  ax  G 
H1( Cl).  Then,  the  interpolation  error  ea  =  ax  —  ip  is  bounded 
by 

||ea||o,m  <  C||aA||iim/j.m,  (10) 

where  C  is  a  positive  constant  and  hm  is  the  diameter  of  the 
smallest  ball  that  contains  the  mth  element.  Then, 

Theorem  1: 

The  error  between  the  solution  ax  of  (5)  and  the  solution 
ax  of  (8)  is  bounded  by  ||aA  -  aA||ii(n): 

_  JVa 

<  Cy/Vh\\I-Tn\\Y^Xn'£\\at\\1,mhm 

m=  1 

c 

+  jll^nll Y^xn  max  \\g*gj\\Li 

N&  Nd,Ns 

x  'y  ^  y  ]  1 1 9i  9j  1 1  o,?7i  1 1  tr  \\i,mhm, 

m=  1  i,j 

where  C  is  a  positive  constant,  Vq  is  the  volume  of  Cl,  Tn  : 
Y  — >  Xn  is  a  uniformly  bounded  operator  given  by  Tn  =  [/  + 
\PnA*a  Aa]~1Pn,  and  gj,g*  are  the  solutions  of  the  variational 
formulations  of  the  boundary  value  problems  (l)-(2)  and  (3)- 
(4),  respectively  [5]. 

Proof:  See  [5]. 

C.  Effect  of  forward  problem  discretization 
Assume  that  D(x),  pa(x)  G  C2(Cl).  Noting  that  Qj,Q*  G 
H1( Cl),  the  solutions  g3 ,  g"  of  the  variational  formulations  of 
the  boundary  value  problems  (l)-(2)  and  (3)-(4)  satisfy  [5] 

gffx),g*(x)GC\n).  (11) 

Let  Gj(x)  and  G*(x)  be  the  finite  element  approximations  to 
gj  and  g*,  respectively,  and  let  h?m  and  hlm  be  the  diameter 
of  the  smallest  ball  that  contains  the  mth  element  in  the  finite 
element  solutions  Gj  and  G*,  respectively.  Then,  a  bound  for 
the  discretization  error  in  the  solutions  Gj  and  G*  can  be 
given  by 

II 9j  ~  Gj\\otm  <  C\\gj\\itmhJm,  (12) 

M-G*b,m<C\\gt\\hmK%,  (13) 

where  C  is  a  positive  constant. 

Consider  the  inverse  problem 

Kax  =  7, 


(14) 


where  K  and  7  are  the  finite  dimensional  approximations  to 
K  and  7,  obtained  by  substituting  gj  and  g*  in  H(xJs,xld;x) 
and  H*(x-xJs,xld)  by  Gj  and  G*,  respectively.  Then, 

Theorem  2: 

A  bound  for  the  error  between  the  solution  ax  of  (5)  and 
the  solution  ax  of  (14)  due  to  approximations  K  and  7  is 
given  by: 

C 

||aA -aA||Li(n)  <  -  max  115*^1^1(0) 

(At A  Nd,Na 

E  E  (  2ll5jaA||o,m  +  llalloollsjllo.m  )  \\g*\\i,mhlm 

m— 1  i,j 


(a)  The  optical  domain  and 
source-detector  configuration  for 
the  simulation  study. 


(b)  The  solid,  square,  and  dia¬ 
mond  lines  correspond  to  the  cross- 
sectional  cuts  taken  from  the  im¬ 
ages  shown  in  Figures  3(a),  3(d), 
and  3(b),  respectively. 


Na  Nd,Ns 

+  E  E  (  2\\g>x\\0: 

771= 1  i,j 


M\ 


0  ,ra 


)  115,11 


Fig.  1 .  (a)  The  setup  used  for  the  simulation  study.  The  squares  and  triangles 
denote  the  detectors  and  sources,  respectively,  (b)  The  cross-sectional  cuts 
taken  from  Figures  3(a),  3(b),  and  3(d),  along  x  direction  at  y  =  3. 


where  C  is  a  positive  constant. 

Proof:  See  [5]. 

We  refer  to  [5]  for  further  details  of  the  discussion  regarding 
the  definition  and  discretization  of  the  forward  and  inverse 
problems. 

III.  Adaptive  Mesh  Generation 

The  adaptive  mesh  generation  is  based  on  minimizing  the 
error  bounds  in  Theorems  1  and  2  and  distributing  the  error 
bound  evenly  on  each  of  the  finite  elements.  For  the  details 
of  the  mesh  generation,  we  refer  to  [6]. 


absorption  value.  Then,  for  each  imaging  problem,  we  consider 
three  mesh  scenarios:  Uniform  mesh  for  both  forward  and 
inverse  problems;  adaptive  mesh  for  the  forward  problem  and 
uniform  mesh  for  the  inverse  problem;  and  adaptive  meshes 
for  both  forward  and  inverse  problems.  We  refer  to  Table  II 
for  a  brief  outline  of  the  first  simulation  study. 

TABLE  II 

The  mesh  scenarios  and  the  background  /ia  values  in  the 

SIMULATION  STUDY. 


IV.  Numerical  Experiments 

In  the  following,  we  present  the  results  of  our  numerical 
experiments.  Note  that  in  all  experiments,  we  use  triangular 
finite  elements  with  piecewise  linear  Lagrange  basis  functions. 
We  apply  Gaussian  elimination  method  to  solve  the  discrete 
forward  problem  resulting  from  the  variational  formulation  [2] 
of  the  boundary  value  problems  (l)-(2)  and  (3)-(4)  [5].  For  the 
inverse  problem,  we  consider  the  discrete  problem  obtained  by 
projecting  (14)  by  collocation  method: 

PnKax  =  Pn  7,  (15) 

where  the  regularization  parameter  is  set  to  A  =  10”7,  which 
is  chosen  based  on  experience. 

A.  Simulation  Study 

In  this  simulation  study,  we  consider  the  geometry  shown 
in  Figure  1(a).  We  simulate  the  optical  data  by  solving  the 
diffusion  equation  at  uj  =  0  on  a  fine  uniform  grid  with  61 
nodes  along  x  and  y  directions,  where  the  refractive  index 
mismatch  parameter  a  =  3.  11  sources  and  11  detectors 
are  evenly  spaced  on  the  bottom  and  top  edges  of  the 
square,  respectively.  The  diffusion  coefficient  D(x)  =  0.0410 
is  assumed  to  be  constant.  The  circular  heterogeneity  with 
absorption  coefficient  ya  =  0.2  cm-1  is  embedded  in  an 
optically  homogeneous  background  with  /./„  =  0.04  cm-1. 

In  order  to  obtain  a  series  of  absorption  imaging  problems 
using  the  same  setup,  we  consider  5  values  for  the  background 


Mesh  (Forward)  Mesh  (Inverse)  Background  fia  (cm  I) 

Uniform  Uniform  0.032,  0.036,  0.040,  0.044,  0.050 

Adaptive  Uniform  0.032,  0.036,  0.040.  0.044,  0.050 

Adaptive  Adaptive  0.032,  0.036,  0.040,  0.044,  0.050 

The  uniform  mesh  used  for  the  forward  problem  discretiza¬ 
tion  has  625  nodes  and  is  shown  in  Figure  2(a).  The  uniform 
mesh  for  the  inverse  problem  has  313  nodes  and  is  shown 
in  Figure  2(b).  We  use  the  algorithms  described  in  Section 
3.1  and  Remark  l(iii),  and  Section  3.3  Remark  2(iii)  in  [6] 
to  generate  the  adaptive  meshes  for  the  forward  and  inverse 
problems,  respectively.  The  number  of  nodes  in  each  of  the 
adaptive  meshes  used  for  the  forward  problem  does  not  exceed 
750.  An  example  for  the  adaptive  mesh  generated  for  a  source 
located  at  (1.0,0)  is  shown  in  Figure  2(c).  The  adaptive  mesh 
for  the  inverse  problem  generated  for  the  case  where  the 
background  /r0  =  0.050  cm”1  has  418  nodes  and  is  shown  in 
Figure  2(d). 

We  consider  the  image  reconstructed  by  using  fine  uniform 
meshes  (61  x  61  nodes  for  the  forward  problem  and  61  x  61 
nodes  for  the  inverse  problem)  as  the  reference  image,  which  is 
assumed  to  possess  no  error  due  to  discretization.  We  compute 
the  error  ||aA  —  dA||ii(n)  for  each  image  reconstruction  and 
tabulate  the  results  in  Table  III.  We  see  that  the  error  in 
the  images  reconstructed  by  using  uniform  meshes  for  both 
forward  and  inverse  problems  is  significantly  reduced  by 
the  use  of  adaptively  refined  meshes.  A  similar  behavior  is 
observed  for  all  choices  of  background  absorption  value. 

We  present  image  reconstructions  in  Figures  3  for  the 


, 

(a)  The  uniform  mesh  with  625 
nodes. 


(c)  The  adaptive  mesh  generated  for 
the  forward  problem  for  the  source 
located  at  (1.0,0):  Background  pa  = 
0.050  cm"1. 


(b)  The  uniform  mesh  with  313 
nodes. 


(d)  The  adaptive  mesh  generated  for 
the  inverse  problem  solution,  with 
418  nodes.  Background  jin  =  0.050 


(a)  The  optical  absorption  image 
used  as  the  reference  for  error  com¬ 
putations. 


(c)  The  reconstructed  absorption  im¬ 
age  using  an  adaptive  mesh  for  the 
forward,  and  the  uniform  mesh  in 
Figure  2(b)  for  the  inverse  problem. 


(b)  The  reconstructed  absorption  im¬ 
age  using  the  uniform  mesh  in  Fig¬ 
ure  2(a)  for  the  forward,  and  the 
uniform  mesh  in  Figure  2(b)  for  the 
inverse  problem. 


(d)  The  reconstructed  absorption  im¬ 
age  using  an  adaptive  mesh  for  the 
forward,  and  the  adaptive  mesh  in 
Figure  2(d)  for  the  inverse  problem. 


Fig.  2.  Examples  of  meshes  used  in  the  first  simulation  study. 


TABLE  III 

The  error  ||aA  —  a^\\Li^  for  each  experiment  described  in  the 

SIMULATION  STUDY  AND  TABLE  II.  THE  FIRST  COLUMN  SHOWS  THE  TYPE 
OF  THE  MESHES  (“U”  FOR  UNIFORM,  “A”  FOR  ADAPTIVE)  USED  IN  THE 
FORWARD  AND  INVERSE  PROBLEMS,  RESPECTIVELY. 


Background  pa'- 

0.032 

0.036 

0.040 

0.044 

0.050 

u-u 

n« 

- 

lx< n)  : 

0.233 

0.256 

0.277 

0.293 

0.301 

A-U 

|Q!A 

l1) n)  : 

0.124 

0.114 

0.117 

0.121 

0.128 

A-A 

ll«A 

zA( n)  : 

0.104 

0.099 

0.099 

0.100 

0.101 

case,  in  which  the  background  absorption  value  is  equal  to 
0.050  cm-1.  Figure  3(a)  displays  the  reference  image  used 
to  compute  the  corresponding  error  values  given  in  Table  III. 
Figures  3(c)  and  3(d)  show  that  the  optical  heterogeneity  is 
resolved  better  by  using  adaptive  meshes  as  compared  to  the 
reconstructed  image  obtained  by  using  uniform  meshes,  which 
is  shown  in  Figure  3(b).  These  results  are  consistent  with  the 
error  values  given  in  Table  III.  Note  that  the  number  of  nodes 
in  the  adaptive  meshes  is  almost  equal  to  the  number  of  nodes 
that  the  uniform  meshes  have.  In  Figure  1(b),  we  show  the 
cross-sectional  views  from  the  reconstructed  images. 

V.  Conclusion 

In  this  work,  based  on  the  error  analysis  presented  in  [5], 
we  presented  a  verification  of  Theorems  1  and  2  and  showed 
that  the  proposed  mesh  generation  algorithms  significantly 
improve  the  accuracy  of  the  reconstructed  optical  images  for  a 
given  number  of  unknowns  in  the  discrete  forward  and  inverse 
problems.  Conventional  error  estimates  do  not  include  domain 
specific  factors.  As  a  result,  the  adaptive  mesh  generation 
algorithms  based  on  conventional  error  estimates  (12)-(13) 


Fig.  3.  The  results  of  the  simulation  study,  with  the  background  pa  =  0.050 


and  (10)  may  lead  to  high  errors  in  reconstructed  optical 
images  (see  [6]). 

We  finally  note  that  the  adaptive  mesh  generation  algo¬ 
rithms  presented  in  this  paper  can  be  adapted  for  the  forward 
and  inverse  problems  of  similar  inverse  parameter  estimation 
problems,  such  as  electrical  impedance  tomography,  optical 
fluorescence  tomography,  bioluminescence  tomography,  mi¬ 
crowave  imaging  etc. 
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